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Abstract 


Uusor II le the second program that has been developed to tutor 
students in the game of Uumpus. From the earlier efforts uith Uusor I it 
was possible to produce a rule-based expert which possessed a relatively 
complete mastery of the game. Uusor II endeavours to teach the knowledge 
embodied in the rules used by the Expert. The Student Model represents 
Uusor s estimation of the student’s knowledge of said rules, and this 
estimation is based primarily on analyses of the player’s moves. The 
Student Model allows Uusor to personalize Its explanations to the student 
according to the student’s current knowledge of the game. The result ie 
a system which, according to preliminary results, is highly effective at 
tutoring students of varied abilities. 


This report describes research done at the Artificial Intelligence 
Laboratory of the Massachusetts Institute of Technology. It was 
supported In part by the Division for Study and Research in Education, 
Massachusetts Institute of Technology and has been submitted to the 
Department of Mathematics in partial fulfillment of the requirements for 
a naster of Science Degree under the supervision of Professor Ira 
Goldstein of the Department of Electrical Engineering/ Computer Science. 
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The Intnoduct ton 
Chap ter' I 


One of the purposes of our educational system is to teach better 
ways to think about problems in general. This is a key justification for 
the study of higher mathematics and the sciences by the general student. 
Similarly, games can be an ideal teaching instrument. Most games 
represent some facet of life but in a much simpler domain, thereby 
allowing the student to develop his problem solving abilities in an 
environment which is more conducive to learning. Such games can be fun 
wh-ile teaching valuable skills. However, students can reach a plateau in 
their game playing and cease to try new strategies. Uhen this occurs, 
the learning process halts, A solution to this problem is to encourage 
the student to improve by occasionally offering suggestions regarding 
improved game strategies. Unfortunately, the cost of providing human 
teachers to watch the game (and offer suggestions) is prohibitive. 
However, it is possible to use a computer to monitor the progress of the 
game and to offer suggestions when warranted. With this goal in mind, a 
program was written to serve as just such an advisor for a computer game 
referred to as "Uumpus hunting" or "Uumpus" 

The relationship of the Uumpus Advisor to the player is illustrated 
in Figure 1.1. The Advisor analyzes the interaction betueen the game and 
the player, giving advice when appropriate. Uhen the Uumpus Advisor 


Th8 9 afl,e of Uumpus was originally described by Yob In Creative 
Computing, and thie particular variation was Implemented in LISP by Greg 
C1emenson. 
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chooses to give advice, it can intercept the player’s messages to the 
game and allou the player to modify his move if he so desires. Also, 
from the player’s point of view, the Uumpus Advisor receives exactly the 
same information #8 the player. The Uumpus Advisor can alter the game 
(shown in dotted lines) to a certain degree. It does so to create 
situations which are more conducive to learning. 


Figure 1.1 

Conceptual (lodeI of a Tutor. 



The game "Uumpus" is an example of a game which encourages 
deductive processes and develops a sense of probabilities. The player 
must seek out and kill a monster, the Uumpus. The player moves in the 
Uumpus’s warren, a network of interconnecting caves containing the Uumpus 
and other dangers, namely bats and pits. At the start of the game the 
player is told the number of caveB in the warren and the number of bats 
and pits. (i,) Before every move the player is told which cave he is in 
and the neighboring caves that he can move to. I f an y of the 

neighboring caves contain a bat he will be informed that he hears 


(ii) In a normal game, there are twenty caves, three pits, and three 
caves with bats. 

liiil Each cave is identified by number. 
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squeaking. Likewise, if a neighboring cave contains a pit, the player 
will be informed of a draft (as pits are bottomless chasms), but in 
neither case is the player told which of the caves are dangerous. 
Whenever the player is within two caves of the Uumpus he will smell its 
horrible stench. If the player enters a cave uith a pit he loses the 
game, whereas if he enters a cave containing a bat, he will be carried to 
a random cave which may contain another bat, a pit, or the Uumpus. The 
Uumpus eats unwary players who stumble into his lair. The player tries 
to visit enough caves (avoiding bats and pits) to locate the Uumpus 
without actually entering its cave. Once the player has found the 
Uumpus, he can shoot an arrow into the Uumpus* lair from a neighboring 
cave, killing the beast. If the player shoots an arrow into a cave and 
the Uumpus is not there, his arrow will ricochet through the warren at 
random for roughly four caves and may kill either the player or the 
Uumpus. In Figure 1.2 part of a possible warren is shown. 


Figure 1.2 





Circled caves have been visited. 

Upper Case: Given Facts 
Lower Case: Deductions 

Playing the game can involve simple deductions and risk 
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minimizations as well as more complex strategies and considerations. At 
the simplest level, the player can deduce that certain caves are 
absolutely safe by the absence of warnings. These caves should be 
explored before any others. At a higher level the player can perceive 
that certain caves are probably safe, i.e. less likely to contain dangers 
than other caves. These perceptions are based on the patterns of the 
warnings and require appl ication of probabil istic heuristics wh i ch are 
commonly used by knowledgeable persons. An advanced player can usually 
deduce the exact location of the Wumpus through a quite thorough 
application of logic, but most players develop a general idea of its 
location without completing all the required deductions (though they are 
wrong often enough to encourage them to improve their deductive powers). 
There are also unusual situations which require very advanced 
considerations of the risks involved in order to select the best move. 
The Wumpus is a game that can be enjoyed by the beginner as well as the 
advanced player. 

In the development of the AdviBor, two different generations of 
programs have been written. They are referred to as Wusor I (Wumpus 
Advisor I) and Wusor II. For the sake of perspective, the next chapter 
will discuss the development of Wusor I, and then the remainder of the 
paper will discuss Wusor II, the most recent Wumpus Advisor. 
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Th& Original Wumpus Advisor. Wusor l 
Chapter 2 



o 


Uusor I can be viewed as a continuation of the work Burton and 
Brown (1975) with the game of West. West is an algebraic game for which 
they developed an Advisor program. However, West was a relatively simple 
domain which was not conducive to more advanced Advisors. As a result, 
it was decided that Wunipus was a domain that was sufficiently difficult 
to allow the development of more advanced Advisors, though it was still 
simple enough to allow the development of a functional program. With 
this goal in mind, work was begun on Uusor I. 

Wusor I was written as a course project for nine graduate students 
at PI.I.T. with its development being supervised by Jim Stansfield and Ira 
Goldstein. As the Wumpus Advisor was required to discern the quality of 
the different moves as well as be able to explain itself when 
appropriate, it quickly became a very complex program. To keep everything 
manageable, the program was written in a modular fashion. This aided in 
the development of the Advisor as it allowed the modules to be written, 
debugged, and modified relatively independently. After its initial 
development, Wusor I was extensively modified by the author. 

Uusor I had three main modules and several sub-modules. The three 
main modules were the Executive, the Bat/Pit Locator, and the Wumpus 
Locator. Associated with the three main modules were various sub- 

(i) Uusor I is fully described in Stansfield, Carr, and Goldstein (197G). 

{n) The Bat/Pit Locator was written with the help of Will Clinger, John 
Avgoustis, and Fred Knoule. The Uumpue Locator was originally written by 
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Figure 2.1 

Ootted llnaa rapraaant significant flows of Information. 
Solid llnaa Illustrate the hierarchy of control. 
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modules which are illustrated in Figure 2.1. Each of the Locator modules 
had two sections, the Evaluator section which computed the merits of the 
various caves, and the Explainer section which explained the results of 
the Evaluator. Also, the Bat/Pit Locator had two levels, the level which 
considered bats and pits separately, and the Combination level which 
Combined these results. Finally, the Executive Evaluator combined the 
results from the Bat/Pit Evaluator and the Uumpus Evaluator to come up 
with an overall utility for each cave. Then the Executive had the option 
of calling upon the Explainer routines to generate explanations for the 
student. This structure is rather contorted and unnecessarily hard to 
understand, but this is primarily due to an incomplete understanding of 
the domain at the start of the project. However, from the work, on Uusor 
I it was possible to gain many insights which finally led to the 
development of Uusor II. 

The Bat/Pit Locator computed an estimate of the probability that 
any cave contained a bat or a pit, but the Uumpus Locator was not 
advanced enough to arrive at estimation of the risks from the Uumpus; it 
only determined which caves could not contain the Uumpus and which caves 
were likely to return information about the Uumpus’ location. (iiii The 
Executive Evaluator could not directly compare the risks involved from 
Bats/Pits and the Uumpus as it did not have any probability estimates for 
the Uumpus. Instead it broke down the different moves into the eight 
categories shown in Figure 2.2 and evaluated the player’s move according 

Neil Rowe, Beth Levin, and Robin Gross. The Route Planner was written by 
Ginny Grammar. 

(1 ' ' * The Uumpus Locator was sufficiently advanced to locate the Uumpus, 
but it could not deal with incomplete information. 
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to these categories. The different Evaluator routines taken together 
comprised the Uumpus ''Expert" {iv) which possessed a knowledge of the 
game. 


Figure 2.2 

Uusor I uses eight classifications for unvisited caves. 

Type 1. A cave that is absolutely safe and that gives information about 
the Uumpus. 

Type 2. A cave that is absolutely safe. 

Tupe 3. A cave that has no bats or pits, but that could contain the 
Uumpus. This cave inherently gives information about the Uumpus. 

Type 4. A cave that might contain bats or a pit, but not the Uumpus and 
which gives information about the Uumpus. 

Type 5. A cave that might contain bats or a pit, but not the Uumpus. 

Type 6. A cave which could contain bats, pits or the Uumpus. 

Type 7. A cave that contains a bat. This classification is a very 
special class that has varying value. 

Type 8. A cave that is certain death. (Either a pit or the Uumpus). 


Throughout the development of the Expert, heuristics were always 
used in preference to lengthy calculations. This uas partly because the 

exact calculations were so lengthy that it was unrealistic to compute 

them, but also because such heuristics were, in fact, used by humans 
players. ^ This correspondence between human and computer reasoning 

greatly simplified explanations. An example of this was the manner in 

which the Bat/Pit Locator determined the likelihood that a cave contained 


(iv) The concept of designing the tutor around an expert was developed by 
Brown (1973) in his Sophie program. 

In particular, the heuristics were those used toy the designers of 
Uusor I. Further experimentation is required to determine how common 
these heuristics are. 
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a bat or a pit. It might have been possible to compute the exact 
probability of a cave containing a bat or a pit, but this calculation 
would have been time consuming and complex. Worse yet, any explanation of 
an exact probability would have had to discuss the relevant permutations 
and combinations, which would almost certainly have been above the level 
of the student. 

Wusor I had only a very limited model of the user’s knowledge. It 
did not keep track of different players, but, instead, asked the user at 
the start of each session for an estimation of his ability. For the rest 
of the session the Expert would make all of its computations at a level 
of complexity hoped to be slightly above that of the student. This had 
the disadvantage that, if the player entered an unrealistic evaluation of 
his ability, he would either be overwhelmed by material well above his 
level, or he would find the Advisor’s advice over simplified. There was 
also the problem that Wusor I did not have fine gradations in the level 
of its advice (it had four levels) due to the expense of programming the 
Expert to function at more levels. 

Wusor I was tested on a very limited basis and was found to be 
reasonably effective at teaching better Wumpus strategies. However, it 
was mostly used by M.I.T. related personnel (including children and 
graduate students) who, we assume, already possessed a logical bent. In 
these cases it was effective, but it was not attempting to teach a better 
way to think about problems (as the players already possessed the desired 
thinking habits) but merely better ways to think about a particular 
problem. In those very few cases where it was advising students who did 
not already have the desired thinking habits, the handicaps of Wusor I 
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were readily apparent. It was not able to carefully guide the player 
through the more basic steps {its simplest level was not basic enough) 
and tended to confuse and frustrate the student. Because of these 
limitations, work was started on a more advanced Uumpus Advisor with a 
more detailed student model and the ability to tune its explanations to 
the particular student. 
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The Expert 
Chapter 3 

Overview of the Expert 

From the work which was done on the expert of Wusor I, it was 
possible to develop a much more powerful and general Expert. The 

deductions required in locating the dangerous caves and the heuristics 
necessary to arrive at an estimation of the danger in cases of 

uncertainty have been simplified to the point where there is a Locator 
Algorithm which requires only very general information (such as the 

number of caves, the number of bats, how far a bat warning propagates, 

etc.) to determine the relative dangers of each cave. The Locator 
Algorithm has three basic steps as shown in Figure 3.1. It i9 executed 
once for each danger being considered, and the results are then combined 
probabilistically. The first step i9 the acceptance and propagation of 
any and all information supplied by the Wumpus game. 

Figure 3.1 

The Three Steps of the Locator Algorithm 
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The Information Propagator. 

For each of the three dangers the Information Propagator marks each 
cave with three pieces of information, the maximum distance to the 
closest danger, the minimum distance to the closest danger, and, when the 
first two numbers eliminate all other possibilities, a marker for the 
exact distance to the closest danger. There are eight theorems which 
allow the Propagator to update this information about a given cave. The 
first two are that any time a cave is visited it can be marked as either 
more than zero away or exactly zero away. ^ Also, in the special case 

of the Uumpus, if the player shoots an arrow into a cave and does not 
kill the Uumpus, then that cave is more than zero away. * M * Next, if a 
cave is visited and has no warning, then that cave can be marked as more 
than N away, where N is the distance that a warning for the given danger 
travels. Likewise, if a warning is sensed, then the cave must be less 
than (N+l) away. The last theorems deal with the propagation of 
information through the warren. If a given cave is more than N away, 
then all of its neighbors must be more than (H-l) away. Conversely, if 
all of a cave’s neighbors are more than (N-l) away, then said cave can be 
classified as more than N away. The above theorems correspond to rules 
zero through seven shown in Figure 3.2. Using additional theorems, more 
information could be propagated, but these eight rules are sufficient to 


Except for bats, the classification of zero away attained in this 
fashion is rather pointless as it also signifies the end of the game. 

(ii> Note that in some implementations of Uumpus, the Uumpus moves when 
an arrow is shot (if it does not kill him). Uusor II does not permit 
this primarily because it would require degrading the data base. This 
could create situations which were too complicated to readily be 
explained to children. 
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mark each cave with all the information relevant to playing the game. 
Many caves can be classified as being safe just through the use of the 
more than N away tag. There are examples of the deductive process in 
Figures 3.3 and 3.4. 






Figure 3.2 

The Logical Rules of the Expert 

A cave can be marked as "zero away" if it was visited and found to 
contain a danger. 

A cave can be marked as "more than zero away" if it was safely 
visited. 


If the player shoots an arrow into a cave and does not kill the 
Wumpus, then that cave can be marked as "more than zero away" 
(Wumpus). 


L3 If a cave is visited and there is not a warning, then that cave is 

more than III away” where hi is the distance that the warning 

propagates . 

L4 If a cave is visited and there is a warning, then that cave is 

"less than (hl+1) away”. 

L5 If a cave is marked "more than hi away" then all of its neighbors 
must be "more than (hl-1) away". 

LB If all of a caves neighbors are "more than (N-l) away”, then it 
must be "more than II away". 

L7 If a cave is "more than (hl-1) away" and "less than (N+l) away", 

then it is "hi away". 


When the algorithm is creating cave-sets and it encounters a cave 
which would be hi caves away but which is also "more than hi away", 
then that cave can not have any contributions to the cave-set. 

If the player encountered a danger in a cave, then the cave does 
not contain a danger of higher priority, i.e. the Wumpus eats the 
player before he can fall into a pit, and he will fall into a pit 
before he is picked up by bats. 

Certain caves can be marked as "more than zero away" based on 
consideration of the different complete cave-sets and the number 
of dangers. 
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In Figures 3.3 and 3.4 there is a standard notation for diagrams of 
the uarren. The nodes of the network are numbered and represent caves. 
Circled numbers represent caves which have been visited by the player. 
To the top right of each visited cave is a marker for whether or not any 
warnings were sensed. ("U" indicates that a warning was sensed, and a 
"NU" means that a warning was not sensed) To the bottom left of each cave 
is any classification that the cave may have. (i.e. more than N, 7 ess 
than M, and J away) The reader is cautioned that, while the the diagrams 
of the warren in the examples are very regular (planar), normally warrens 
are much more random in their arrangement. 

In Figure 3.3, the danger being considered is pits. Hence, 
warnings propagate for a distance of one cave. In Figure 3.3A, the player 
has visited two caves, neither of which had any warnings. Therefore, the 
visited caves are more than one away and all the neighbors are more than 
zero away (i.e. safe). Upon visiting cave 3 (Figure 3.3B), the player 
receives a warning, indicating that cave 3 is less than two away. Since 
it is also more than zero away, it must be one away (i.e. cave 3 is next 
to a pit). 

In Figure 3.4 the danger is the Wumpus whose warning propagates for 
a distance of two caves. At the start (Figure 3.4A), the player has 
visited various caves and has discerned that caves 7 and 18 must contain 
pits. He has also visited two caves of relevance, caves 5 and 6. As no 
warnings of the Wumpus were sensed, caves 5 and 6 were marked as more 
than two away. All the neighboring caves were then marked as more than 


l ' i^ An interesting experiment is to determine whether the restriction 
to regular warrens increases the speed at which people learn the game. 
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Figure 3.3 


The Danger is Pits. 

U indicates a warning. 

NU indicates the absence of a warning. 
A warning propagates for one cave. 
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one away. Next the player visited cave 3 (Figure 3.46) and sensed a 
warning. This indicated that cave 3 was less than three away. As it was 
also wore than one away, cave 3 was marked as being exactly two caves 
from the Uumpus. Caves 2 and 4 were marked as more than zero away by 
dint of cave 3’s classification of more than one away. In Figures 3.4C 
and 3.40 the player visited caves 2 and 4 which he knew to be safe. At 
these caves he sensed a warning and marked the caves accordingly. 
Finally he visited cave 9 (Figure 3.4E) and could then conclude that the 
Uumpus must be in cave 1 (by a process to be described later). 

The special classification of N caves away is made because it is 
the only classification used extensively in the later steps of the 
Locator Algorithm. In particular, if one can determine all caves that 
are exactly N caves away from a cave which is classified N aivay then one 
has what is referred to as a complete cave-set. (A partial cave-set is 
one in which all the caves which are known to be N caves away are listed, 
but the set is not necessarily complete). In the case of bats and pits, 
the determination of the complete cave-set is especially simple since the 
only classification of relevance is one away and the complete cave-set is 
just the immediate neighbors. Cave-sets are important because at lea st 
one member of each complete cave-set must contain one of the given 
dangers. Therefore one can eliminate from the cave-set those caves which 
can not possibly contain the given danger for other reasons (as indicated 
in the data base). The result after this elimination is called a reduced 
cave-set. (Logical rule L8 in Figure 3.2 is the formal statement of the 
rule to reduce cave-sets). 

In Figures 3.3 and 3.4 there are several examples of cave-sets. In 
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Figure 3.3B the complete cave-set is caves G, 7, 8, and 17. However, as 
caves G and 7 are both more than zero away, the reduced complete cave-set 
is just caves 8 and 17. This means that either 8 or 17 must contain a 
pit. In Figure 3.4C the partial cave-set is caves 6, 7, 1, 10, and 3. 
As caves G, 7, and 10 are safe from the Uumpus, the reduced partial cave- 
set is caves 1 and 3. In Figure 3.4D there is an excellent example of 
the application of logical rule L8. The reduced cave-set is caves 1 and 
9, but this cave-set is also a complete cave-set as the neighbors of cave 
4 are known and, uhile the neighbors of cave 10 are not known, cave 10 is 
marked as more than one away and none of its neighbors can possibly 
contain the Uumpus. (Hence they can not be members of any reduced cave- 
set for the Uumpus). Uhen the player visits cave 3 (Figure 3.4E) he is 
taking a big chance, but once he has visited it he can eliminate it from 
the reduced complete cave-set (it is more than zero away). Then the 
player can conclude that the Uumpus must be in cave 1 as it is the only 
member of the reduced complete cave-set. In all future references, it 
will be assumed that a cave-set is actually a reduced cave-set, since the 
Locator Algorithm always reduces the cave-sets. 

The Possibility Evaluator 

The Possibility Evaluator marks certain caves as "more than zero 
away" by considering the different complete cave-sets and the number of 
dangers. It implements logical rule L10 shown in Figure 3.2. If the 
number of complete cave-sets exceeds the number of dangers, then it is 
likely that this step of the Algorithm will deduce that certain caves can 
not contain a danger. For example, if there are only three caves with the 
given danger, and there are complete cave-sets of: 

( 1 , 2 ) 
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(3,4,5) 

(6,7,8) 

(8,9), 

then one can deduce that caves 6, 7, and 9 must be safe. This means that 

cave 8 must contain one of the given dangers. The result after the 

application of the second step iss 

( 1 , 2 ) 

(3,4,5) 

( 8 ). 

An explanation for why cave 6 must be safe could be, "you can deduce that 
cave 6 must be safe as there are only three pits. You know that there 
must be a pit in either cave 1 or 2, just as you know that there must be 
a pit in one of the caves 3, 4, and 5. Likewise, there must be a pit 
either cave 8 or 9. This accounts tor all three pits and hence there can 
not be a pit in cave 6." 

An algorithm to determine which caves are safe in this case is to 
choose N complete cave-set9 that have no intersections (i.e. no 

members in common). Then any caves that are not a member of the union of 
the N cave-sets must, of necessity, be safe. The selection of the N 
complete cave-sets is carried out by exhaustively trying all 
possibiIities. 

Unfortunately, this algorithm does not find all possible caves that 
must be safe. For example, if there are only two dangers and there are 
cave-setss 

( 1 , 2 ) 

(2.3) 

(1.3) 

(1,3,4), 



s the total number of dangers of the type being considered. 
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then cave 4 can not possibly contain one of the given dangers. An 
algorithm to find all safe caves is to construct all combinations of N 
caves from the set of all caves that are members of any cave-set, 
eliminate combinations which do not fulfill the requirements (i.e. do not 
have at least one cave from every cave-set), and mark those caves uhich 
are not a member of any of the remaining combinations as safe. This 
algorithm is roughly as efficient as the first algorithm, but, 

unfortunately, it is sometimes not possible to explain the results of 
this last algorithm in simple terms. This is especially true in those 
unusual case 9 where the first algorithm failed. For example, the 
explanation for the example above would read, "Cave 4 can not contain a 
pit because there are only two pits. If a pit were in cave 4, there would 
only be one other pit to explain all the evidence. We know that there 
must be a pit in either cave 2 or cave 3, but if the remaining pit were 

in cave 2 there could not be a pit in one of caves 1 and 3 (and we know 

that there must be a pit in one of caves 1 and 3). Likewisej If the 
remaining pit were in cave 3 there could not be a pit in either of caves 

1 and 2 (and we know that there must be a pit in one of caves 1 and 2). 

Therefore there can not be a pit in cave 4." If this explanation seems 
rather obtuse, the reader is cautioned that the case explained above is 
the simplest possible case in which the first algorithm fails. Most 

other cases are significantly more complicated (and virtually 
unexplainable). (v) For this reason, the first algorithm was used instead 
of the second algorithm. 


(v) Interested readers are challenged to write an explanation in simple 
terms for why cave 11 can not contain a pit if there are three pits and 
cave-sets (1,7,3,11), (1,2,3), (3,4,5), (5,6,7), (7,8,9), and (9,10,1). 
The explanation should not contain any new terms like "cave-sets". 
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The third step of the Locator Algorithm estimates the probability 
that each cave contains a danger. It considers all the cave-sets, 
complete and partial, and is guided by five basic principles. The first 
principle (Pll of Figure 3.5) is that each member of a cave-set has an 
approximate probability of 1/N where N is the number of caves in the 
cave-set. In the case of partial cave-sets, N is an estimate of the size 
of the complete cave-set. For example, in Figure 3.4C the partial cave- 
set is caves 1 and 9, but N would be four as one would expect that the 
neighbors of cave 4 would add roughly two more caves to the cave-set. In 
any case, the probability of 1/N is not exact because it is often 
possible that some cave-sets will contain more than one dangerous cave. 
However, the estimate does give a relative measure of the risks involved 
with the different caves. 

The next general principle is that if a cave has been identified as 
definitely containing a danger, that cave will explain the warnings 
responsible for any cave-sets of which it is a member. (vi) This makes 
it less likely that the other members of these cave-sets will contain a 
danger. The third principle is that if a cave is a member of more than 
one cave-set, it is more likely that the cave does, in fact, contain a 
danger. Conversely, the other members of these cave-sets are less likely 
to contain dangers. Lastly, it is better to shoot into caves that are 
likely to contain the Uumpus before visiting them. This principle 
considers the fact that if the Uumpus is not in the cave, then the arrow 




Every cave-set with more than one member can be attributed to an 
originating cave at which a warning uas received. 
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Figure 3.5 

Pllj Equal Likelihood Principle set P»l/N where N i* the nu*«r of ci»i in the imiini 

cave-set of which sold cavo la a aeaber. If corresponds to the slapllfylng assuaption that each 
cavo-aat had exactly ona danger randomly aailgnad to It. 

P12i Redundant Evidence Principle When it la noted that a given cava-aat i* a superset of 
another cavo-aat, than tha auparaat cava-aat la no longer conaidarod aa the uarnlnga Involved are 
coaplately explained by the aubaet cava-aat. For the cavee regaining In the eupereet after the 
aubaet la removed, tha probability la eat tot 

u 

p=- 

T-I-S 

wherei I» tha nuaber of Identified dangera. (The number ol independent complete cava-aata). 

U« tha nuaber of unidentified dangera. (The total nuaber of dangera alnua I). 

3. tha total nuaber of aafa cavoa baaed on the preaent evidence. 

T» tha total nuaber of cavoa. 

This probability la also applicable to caves for which there la no evidence to estlaate the danger by. 


P13f Multiple Evidence Principle for those caves which are aeabera ol aore than one cave-set the 
probability is set to< 

P, = 1 . 1 X (l-i) X (1-i) X (1-jL) x ... 

whsrei N» the nuaber of caves In the given cave-set of which said cave is a aeaber. 

This expression corresponds to tha slaplIfying asauaption that axactly one danger is randoaly 
assigned to each cave-set. It is P (danger) -1-P (no danger) where 
P (no danger) “P (no danger froa first cave-set) *P (no danger froa second cave-set) ♦, . . 


P14{ Adjust For Multiple Evidence Principle for caves which are aeabera of cave-sets to 
which the above loraula Is applied, the probabilities are adjusted tot 


fp t = 


1 M-] j^ P lA'N 1 ^ + ^ P 1 B~ N-j ) + ' (N 1 -l) + ^ P 2A"N a ^ + ’ ” ^ 


HVTT 


wherei FP. the final probability for the specified cave. 

N> the nuaber of caves In the specified cave-set. 

P« the probability Initially calculated (above) for the specified cave, 
da the nuaber of cave-sets of which the spacifled cava Is a aeaber. 

Nuaerlc subscripts are for tha various cave-sets of which the Ith cave is a aeaber. 

Letter subscripts associated with nuaerlc subscripts are for the various aaabers of the specified 
cave-set, oalttlng the Ith cave. 

This expression averages the changes froa PI3 out aaongst the other aaabers. It has no theoretical 
Justification, but it works very nicely. 


P15j Shooting Principle Whenever P^,^,, exceeds P Cut 0{| It Is reset toi 


FP, 


(1 ” p w ) x 3 


Cut Off 


^ " P Cut Off j x 3 * P Cut Off • 25 
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Mill start ricocheting with roughly a one third chance of killing the 
player. It is applied only if there are sufficient arrows so that the 
loss of an arrow will not end the game. These five principles are stated 
as rules with their accompanying formulas in Figure 3.5. The formulas 
are not referred to in any explanations but are simply used internal ly to 
allow the Advisor to evaluate the relative merits of the different moves 
more precisely. 

Combination of the Results 

To complete the comparisons between the different moves, the Expert 
cal Is the Locator Algorithm three times, once for each respective danger. 
Then it combines the probability estimates to come up with an overall 
estimation of the value of visiting each cave. The Expert is guided by 
eight basic principles in computing the overall evaluations. They are: 

C0: Safe unvisited caves are preferable to all others. 

Cl: Given that it is necessary to take some risks, those moves with 
the lower risks for all dangers are preferable. 

C2: Risking multiple dangers is worse than risking a single danger, 
presuming there is a common danger and that the common danger is equally 
likely in both cases. 

C3: Presuming that there are common dangers that are equal ly 
likely, it is preferable to visit those caves where the unequal risks are 
lower. 

C4: Bats are not as dangerous as pits or the Uumpus. 

C5: Locating the Uumpus is an important goal, and it is worthwhile 
to visit caves which are likely to give information about the Uumpus’ 
location. 

CS: Uhen the player is in a very dangerous section of the warren 
with no known route to a safer area, it is sometimes advisable to seek 
out bats in the hope that they will carry the player to a safer section 
of the warren. 

C7: Uhen the location of the Uumpus is probably knoun, it is best 
to shoot into the cave. 
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Figure 3.G 

The Combination Principles were converted into the following formulas: 


COST- 1 - < 1 -C B .,) * (1 -C p ,, > * (l-C Uu „ oul ) 


(C0 thru C3) 


C =P 

Uumpus Uumpus 


r =P 

'-Pit r p i t 


E +N 

& PIT ^Uumpus 


^Bai^Bat*" 


N -E 

'Cava* '"Bat 


GAIN= V Cava+ V Bat 


(C4) 


V =B*(1-P )(1-P )(1-P } 

Cave D * vi r Bat' ' x r Pit' lx ‘Uumpus' 


(C5) 


j _w -E -N 

’Cave* ’Visited '"Pit ^Uumpus 


^Bat = ^Bat* ^Uumpus^ *' 


N -E 

'Caves ‘"Bat 


<C6> 


where: P* The probability for the specified danger for the cave being considered. 

B« fl measure of the value of the information to be gained at that cave, (normally one). 

It is increased if the cava is likely to give information about the Uumpus. 

N« The number of caves of the type specified. 

E* fin estimation of the number of caves with the specified danger that do not contain 
another danger of a higher priority. The priorities from highest to lowest are Uumpus, pits, 
and bats. For example, 

^Pit ^Uumpus 

- ><1 -> 

^Caves ^Caves 

The above formulas can be used to give an index of: 

GAIN 

INDEX= - 

COST 


The caves can then be ordered according to the INDEX. If C0ST»8, the INDEX would go to infinity; 
there would have to be different sizes of infinity. Instead, if the COST*0, the INDEX is set to 
very large number times the GAIN. 
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These principles were incorporated into the formulas shown in Figure 3.G 
and are used to compute the index for each cave. This index is an 
estimation of the value of visiting each cave. 

Sumary 

It is uorth noting that the Expert described herein does not always 
find the best move possible. In several cases it was decided to use less 
accurate algorithms in order to make the computations of the Expert more 
understandable to humans. An example of this at a global level is the 
manner in which cave-sets are implemented. In Figure 3.7 there is a 
diagram of a section of the warren. The actual Locator Algorithm would 
conclude that caves E, E, and 7 are all equally likely to contain the 
Uumpus. In fact it is more likely that cave G contains the Uumpus. as a 
Uumpus in cave S or cave 7 does not explain all the warnings directly. 
{vil * An improved algorithm could apply probability rules throughout the 
propagation of cave-sets. A simplistic approach to this would be to 
assign to cave 3 and cave 4 a probability of one half of being next to 
the Uumpus. Iviii) j n ^his manner one achieves the results shown beneath 
Figure 3.7. These results are more realistic, but the improved algorithm 
has the unfortunate drawback of destroying the precise division between 
the probability rules and the logical rules. It also makes it much more 
difficult to break the rules into bite-sized chunks which are easy to 
teach. For such reasons, the improved Expert does not carry out its 
computations in the most accurate manner possible. 

required between cave 5 and 7 to 
approaches suggested in Stansfield, 


An additional connection is 
explain all of the warnings. 

(viii) This is one of the general 
Carr, and Goldstein (197S). 
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5 


6 


7 

Probabilities For Uumpus 


Cave 

Old Algorithm 

Neu A1gorithm 


Probabi1ity 

Probabi1ity 

5 

.3333 

.25 

6 

.3333 

.50 

7 

.3333 

.25 


The sixteen danger specific rules (the eleven logical rules in 
Figure 3.2, and the five probability rules in Figure 3.5) together with 
the eight combination rules are a concise definition of the Uumpus 
Expert, flany of the intricacies of the expert, such as the priorities of 
the different probability rules when there are conflicts, have been 
omitted. Such details were worked out in the actual program, but are not 
worthy of note in this paper. The different rules have been presented in 
(and are numbered in) order of increasing complexity and, to a certain 
degree, in order of logical dependency. Normally, each rule uses only 
the input data and the results of lower numbered rules. The 24 rules 
taken together comprise the knowledge which the Uumpus Advisor endeavors 
to teach. In this sense, the entire Uumpus Advisor centers around the 
Expert. 


Figure 3.7 
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The Improved Wumpus Advisor, Wusor ll 
Chapter 4 

The Overall Structure 

Uusor II has been implemented with the structure shown in Figure 
4.1. The Uumpus Expert module is an implementation of the expert just 
described. The Move Comparer analyzes the different moves noting the 
knowledge required to justify their relative merits. The Psychologist 
uses this information to determine how the player’s actions (moves) 
reflect upon his knowledge of the game and informs the Student Model as 
such. The Student Model can then decide what the player is presumed to 
know, and, from this, it determines what material is at the student’s 
leve l« The Uumpus Advisor uses this information (added to the 

results of the Move Comparer) to determine what to explain to the 
student. (ii) If it decides to give advice to the student, it will call 
upon the English routines to explain certain of the results of the 
Expert. The English routines use the Student Model to prune their 
explanations in light of the player’s current knowledge. The Uumpus 
Executive coordinates the activities of the Uumpus Advisor and Uumpus 
game and implements the conceptual model of a tutor shown in Figure 1.1. 

(l) Note that within this paper, "Student Model" will be used to refer to 
more than just the data about the student, but also the routines which 
maintain and evaluate the data. In effect, the Student Model is a "black 
box" which answers queries about the student's current state, and we will 
not be concerned about the details of its functioning until later. 

Burton and Brown (1975) describes a previous Advisor program for the 
game of Uest which was built around an Expert and which utilized a 
primitive Student Model. 
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As the Uumpus Expert performs its calculations, it keeps a record 
of the processes it went through to arrive at its results. The 
information relevant to each cave is kept on the cave’s property list so 
that the information is easily accessible. Tags are also put on the 
property lists which correspond to the unique numbers assigned to each of 
the rules listed in Figure 3.2 and Figure 3.5. In this manner the Expert 

provides a complete set of sub-conclusions and sub-results along with the 

i 

final results. These property lists are used extensively by the English 
Generator and Move Comparer. Both of these modules contain a structure 
modelling that of the Expert. 

Figure 4.1 
Diagram of Wusor II 
Arrows Represent Data 
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/ > 



/ / 
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The Uumpus Advisor also has the ability to modify the Uumpus game 
to create situations more conducive to learning. This has been done 
primarily by allowing the Uumpus Advisor to define the number of bats, 
pits, Uumpii, and caves appropriately as well as choosing the initial 
caves encountered by the student. The starting cave is selected by the 
Advisor, and the player’s first move is transposed to the desired cave 
(all of the choices for the first move are isomorphic). This prevents 
the player from losing on the first move before he has sufficient 
information to make intelligent decisions and also allows the Uumpus 
Advisor to challenge the student with simple idealized problems that are 
appropriate for a student of his level. Examples of problems selected 
for students of differing levels are shown in the Appendices. As a 
further addition, the Uumpus Advisor could be given the ability to 
restructure the warren as the game progresses. This modification is non¬ 
trivial since these changes must be transparent to the player, but, more 
importantly, the Uumpus Advisor must be able to fully describe the 
situation it is trying to create. The current Uumpus Advisor does not 
have sufficient information available to it to allow it to fully specify 
more complicated situations, but it could attempt to move dangers so 
that, if the student were playing uisely, he would gain positive 
reinforcement. Conversely, if the player were playing unwisely, negative 
reinforcement would be supplied. 

In the remainder of this chapter we will discuss the curriculum of 
the Uumpus Advisor. In the next chapter, we will fully describe the 
Student Mode I and its relationship to the Move Comparer and the 
Psychologist. Then in Chapter S we will discuss the tutoring strategies 


Page 31 




The Wmpus Advisor 


Uusor II 


which have been selected for Uusor II, In Chapter 7 we will describe the 
English Generator before we discuss (in Chapter 8) the experimentation 
which is appropriate for the Uumpus Advisor. 


Page 32 




The Wumpus Advisor 


Wusor II 




' ' ’ "i ' . ■> ■ r '< r'- ‘ ► ■. ’C • -' " - ’ 1 ^ f 11 

The Curriculum 

There is a definite hierarchy for the different rules used by the 
Expert, which makes it logical to have a corresponding hierarchy in the 
curriculum of the Wumpus Advisor. It is not realistic to explain the 
different probability rules until the student has mastered some of the 
logical rules used in the derivation of the cave-sets because the 
probability rules are so dependent on these cave-sets. Because of such 
dependencies, the curriculum of the Wumpus Advisor has been broken down 
into five phases. In each phase, the Wumpus Advisor attempts to 

teach certain rules, and the student will not be advanced to the next 
phase unti I he has mastered those rules which are the foundations for the 
next phase. In the zeroth phase, the student is taught the basic rules 
of the game. In particular, the player is taught that he is allowed to 
backtrack, and that, if a cave neighbors on a cave in which there 

were no warnings, that cave is necesssri ly safe. In phases one through 
four, the player is allowed to advance separately with respect to each 
danger and each danger has its own curriculum. This is necessary because 
the complexity of the different rules varies widely according to the 
distance through which warnings propagate. For this reason, there are 
two basic curriculums: a curriculum for when warnings propagate one cave 
and a curriculum for when warnings propagate further. 


®IP» (Barr, Beard, and Atkinson, 1975) a CAI program for teaching 
elementary programming skills, has considered the issue of the conceptual 
dependencies between the skills which are taught. BIP takes the more 
general approach of formally stating these dependencies in an Information 
Network. This task is performed implicitly in the phases of the Wumpus 
Advisor, along with various other functions. 

*' v) Knowledge of backtracking is essential for good play, and some new 
players do not grasp it on their own. 
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The curriculums for bats and pits presumes a warning distance of 
one cave. In the first phase of this curriculum, the player is taught 
how to perceive which caves are safe from the absence of a warning. He 
must master these rules before he can advance to phase two where he is 
taught the first probability rule (Pll). The player must master this 
probability rule and its supporting logical rules before he can advance 
to phase three. In phase three, the player is taught the reasonably 
complicated probability rules (P12 and P13) and must master both of them 
before he can advance to the last phase. In the last phase, the player 
is taught the most advanced of the probability rules (P14). This rule is 
left to the last as it has a conceptual dependence on the two preceding 
rules. All of the rules taught in each phase are shown in Figure 4.2 
along with the necessary conditions for advancement to the next phase, 
(v) 


In Figure 4.2, there is a new rule, T19, in the first phase of the 
Uumpus curriculum. This rule is a specialization of the logical rule LS. 
It is relevant whenever a "More Than" classification involves N 
applications of rule L5 where N is the warning distance. It is necessary 
in cases where the warning distance is greater than one because in these 
cases students will sometimes not apply LB to the limit. 
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Uusor II 
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The Uumpus curriculum presumes a warning distance of mor 

cave. In the first phase of the Uumpus curriculum, the player 

the logical rules necessary for perceiving which caves are saf 


Uumpus. This phase has the same rules as in the bat/pit curriculum, but 
because of the increased warning distance, the rules are more complicated 
to apply. In the second phase, the Uumpu9 Advisor teaches the player 
those rules necessary to arrive at cave-sets. This separation of the 
rules for deriving cave-sets from the first probability rule is made 
possible because of the special value assigned to caves which are likely 
to give information about the Uumpus. The Advisor can speak of the value 
of completing cave-sets without actually getting involved with the 
details of probability. The separation of the logical rules used in the 
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derivation of cave-sets from the probability rules is necessary because 
of the extra complexity of a warning distance of two caves. This not 
only makes the logical rules more complicated, but also makes the 
probability rule Pll more complicated as it now involves estimates for 
the size of the cave-set. In the third phase, the player is taught Pll 
and P15. He must have mastered Pll before he can go on to the fourth and 
final phase. In the last phase the Uumpus Advisor attempts to teach the 
P12 and P13. This is not possible in the standard game as there is only 
one Uumpus and there are no situations to demonstrate these rules. As a 
result, the Uumpus Advisor changes the game so that there are two Uumpii 
in the warren as soon as the student advances to the last phase. 

The player is allowed to advance independently through the last 
four phases of each danger. This allows for the possibility that the 
player might learn the use of a rule with respect to one danger, but 
might not recognize that it also applies to another danger. However, 
once the player has mastered a few of the logical rules for bats and 
pits, it can be expected that he will recognize their correspondence for 
other rules. Ivii) jr or ^ h j s reason, there ie a function in the Student 
Learning Mode I which notes when the player has learned several 
corresponding rules for bats and pitsj thereafter, it presumes that the 
player will recognize the correspondence between any other rules for bats 

(vi) Note that since there is only one Uumpus for the earlier phases, the 
logical rule which considers the number of dangers (L10) becomes much 
simpler and is taught in the same phase with Pll. 

^ v ' 1 ) This statement is based upon the players which the author has 
observed. The author has observed fifteen or so students in their 
interactions with Uusor! in all cases the students quickly learned the 
correspondence between applicable bat and pit rules. However, this issue 
warrants further experimentation. 
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and pits and adjusts the Familiarity Values accordingly. The player is 
taught the combination rules (rules C0 thru C7) independently of the 
different phases. However, in teaching the Combination rules, the Uumpus 
Advisor takes into consideration the conceptual dependencies of the 
ruIes. 

The curriculum that is taught by the Uumpus Advisor is designed to 
first teach the student the basic rules and then the more complicated 
applications of these rules. The total knowledge is broken down into 
specific items, and these items are grouped into phases so that each 
phase has applications that can be taught directly. This allows the 
Uumpus Advisor to make short but meaningful explanations to the student 
at a I I times. 
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The Psychologist and Student Model 
Chapter 5 

An Overview of the Move Comparer, Psychologist, and Student Model 
The Student Model along with the Move Comparer and Psychologist 
have the purpose of determining just what moves can be explained to the 
student and of determining what rules can be pruned from the explanations 
to be given. These two goals require that the Student Model be able to 
determine which rules are known to the Btudent. If a human tutor were 
faced with such a problem, his Student Model would be based on several 
sources of information. This sort of generalized Student Model is shown 
in Figure 5.1. The decision of whether or not the student knows a 
particular rule is based on the following inputs. 

1. How often the student seems to have applied the rule. 

2. How often the student has had the rule explained to him. 

3. The student’s knowledge of other rules (upon which this rule 
depends). 

Consideration must also be given to other important points. They are: 

4. How quickly the student learns. 

5. The student’s initial knowledge of the rules. 

S. The time period that has elapsed and how forgetful the student 
i s. 

Of course, these are not the only things considered by a human tutor but 
they are certainly very important considerations. Our Student Model wi I I 
take all of these inputs into consideration. 
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Figure 5.1 

Generalized Student Model 


Apparent Application 
Of Rules 


Explanations Given 



Q Knowledge of 

Prerequisite Rules 

Knowledge of 


Expert's Rules 


In Figure 5.2 there is a diagram of the data flow between the Move 
Comparer, Psychologist, and Student Model as well as the various sub- 
modules of the Student Model. The Move Comparer takes any two moves and 
determines which rules are necessary to justify their relative merits 
with respect to a single danger. (l) It also queries the Student Model to 
determine whether or not these rules are known to the student, acceptable 
for teaching, etc.. The Psychologist uses the results of the Move 
Comparer according to the Combination Rules to determine if the student 
has demonstrated a working knowledge of any danger-specific rules while 
the Uumpus Advisor uses them according to the Combination Rules to 
determine which of the Expert’s results it will explain to the user. The 
loop in Figure 5.1 representing the student's knowledge of other rules is 
implemented through the loop from the Learning Model to the Move Comparer 
to the Psychologist to the Knowledge Model and back to the Learning 
Model. The Student Knowledge Model maintains Familiarity Values which 
represent the student’s familiarity with any given rule. The Student 
Learning Model uses its estimate of the student’s learning ability to 

i'*. In Appendix C there is a complete description of how the Move 
Comparer functions. 


Page 39 











The bJumpus Advisor 


The Student Model 


interpret the Familiarity Values and determine which rules the player is 
presumed to Know. With these rules, the Student Learning Model can 

implement the curriculum of the Uumpus Advisor by computing which rules 
are acceptable for teaching. Finally, the Student Model Critic analyzes 
the interactions between the Psychologist and the Learning Model and 
adjusts the Learning Model as appropriate. ^ 1 '' 

Figure 5.2 

Arrows Represent Data 



<‘i) In this chapter there will be many references to the student’s 
knowledge of the different rules. This will normally refer to the 
Student Learning Model’s estimate of the student's knowledge. 

(iii) The Student Model also maintains disc files on all of the users. 

These files are keyed on the student’s first and last name; any time a 

new player uses the Uumpus Advisor, a new file is created. It is copied 
onto disc at the end of each session, and his file is reloaded at the 

start of a new session. The fi le contains the variables and arrays of 

the Student Knowledge and Learning Models as well as various other 
information. 
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The Psychologist and Combination Rules 

ihe Psychologist begins its analysis when the playen asks to visit 
a cave in the fringe area (iv) , and it assumes that, if the player has 
moved to a cave which is distant from the starting cave, he must have 
discerned that something was wrong with those moves which were closer to 
the starting cave. (v) The assumption that the player recognized a fault 
with each of the worse moves is qualified by considering the route which 
the player took to his move. If the route taken was non-optima I, the 
Psychologist will consider only those caves which were very close to the 
starting cave. If the player takes a particularly lengthy route, the 
Psychologist will consider only the immediate neighbors of the starting 
cave. (vii) Likewise, if the Wumpu9 Advisor has recently advised the 
player to go to the move which he finally selected, the Psychologist will 
not consider his move as it can not be certain just how much of the 


( . ,V) The fhinge area contains those caves which the player has not 
visited and which are accessible to the player through caves which he has 
a I ready visi ted. 


The starting cave is the last fringe cave which the player 
the last cave at which the player shot an arrow or received 
the Ulumpus Advisor. 


visited or 
advice front 


(vi) The Ulumpus Advisor has several features which are intended to aid 
the player during the game. One of these is a route planner which will 
explain an optimal route to any cave that is accessible through visited 
caves. If the player uses this feature to find a route to a cave and 
then goes there directly, the Psychologist will presume that the player 
recognized some fault in a77 caves which were worse than the selected 
move. 

Random chance becomes very likely when the player takes an 
extremely indirect route. Therefore, the Psychologist only considers 
caves that are within N caves of the starting cave where N-20-L-1, L is 
the length of the route actually taken by the player, 0 is the length of 
the optimal route, and N has a minimum value of one. 
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advice the player actually understood. IVI u 

Presuming that there were worse moves for the Psychologist to 
consider, it would determine if the player has demonstrated a working 
knowledge of any rules. The Psychologist would conclude that the player 
has demonstrated a working knowledge of a group of rules with respect to 
a certain danger only if there were no other plausible explanation for 
why the player's move was the better move. These conclusions are based 
on a restrictive definition of the first four Combination Rules. There 
are examples of this in Figure 5,3. In Figure 5.3A, the last fringe cave 
visited by the player was cave 5. The player then moved directly to cave 
12. Cave G is a worse move because it risks pits and the Uumpus. 
However, in this case the Psychologist would do nothing as it is not 
clear whether the player perceived the danger from pits and/or the danger 
from the Uumpus. The player also bypassed cave 3, a cave that was unsafe 
for bats, but, in this case, the Psychologist would presume that the 
player had perceived that cave 3 was unsafe (the default) and that cave 
12 was safe from bats. It would note that the player had demonstrated a 
working knowledge of those rules which made cave 12 safe from bats. 
(Logical Rules L3 and L5 for bats and Combination Rule C0). In Figure 
5.3B, the player started at cave 18 and chose the better move, cave 14, 
bypassing cave 17. In this case, there was a common danger from bats and 
a lesser risk from pits. The Psychologist would assume that the player 
saw that there was a smaller risk from pits in cave 14 because there was 
absolutely no difference in bats for the player to perceive. It would 


^ Vl 1 ' ^ Correspondingly, the Psychologist will not consider worse moves 
which the Advisor had previously informed the player were bad. 
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note that the player had demonstrated a working knowledge of the rules 
necessary to explain both caves’ probabilities for pits. (Logical Rules 
LI. L4, and L7 for pits, Probability Rules Pll and P12 for pits, and 
Combination Rule C3). 


Figure S.3 




The Uumpus Advisor also implements ths different Combination Rules, 
but it uses a less restrictive definition of the Combination Rules. 
While the Uumpus Advisor will only explain why a move is better because 
of a single danger, it is acceptable for the move to be better for 
reasons other than those explained. If the player asked to move to cave 
fa in Figure 5.3A, the Uumpus Advisor would advise him to move to cave 12 
instead because of a smaller risk from the Uumpus. The Uumpus Advisor 
f**\ uould only cursorily mention the smaller danger from pits if it mentioned 

pits at all because it tries to keep its explanations short. 
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Sometimes a player will make a lucky move which gives the 

appearance of a mastery of certain rules but which does not really 
indicate a knowledge of said rules. To restrict this possibility, the 
Psychologist first checks to insure that the rules in question are not 
well above the expected level of the student. If they are, the 

Psychologist will consider his move a lucky move. As a further 
precaution, the Psychologist compares the move selected by the player 

with all the better moves available to him. If there are any moves which 
are better than the selected move and which the user should have known 
were better, the Psychologist will degrade the Student Knowledge Model 
appropriately. The Psychologist determines if the student should have 
recognized the better move by insuring that the move in question was 
better than or exactly the same as the selected move in every way. It 
then insures that the student was thought to know all those rules which 
indicated the quality of the better move, and, if so, it will degrade the 
appropriate rules of the Student Knowledge Model. In some cases, this 
degrading of the Student Knowledge Model will cause the Student Learning 

Model to decide that the player does not actually know the rule(s) in 

question, and it will move the player to as low a phase as necessary. 


^ Rules are considered to be above the expected level of the student if 
they are not rules which are considered acceptable for teaching. 
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The Student Learning Model 

Just as the Student Knowledge Model can be viewed as an overlay of 
the rules (and knowledge) of the Expert, the Student Learning Model is an 
overlay of those tutoring capabilities possessed by the Advisor. It is 
composed of those abilities which are deemed appropriate for each 
student. (><l} In Goldstein’s Coach proposal (1977), an Advisor is 
envisioned capable of different explanatory strategies so that the 
Learning Model contains those explanatory strategies which are 
appropriate for the particular student. However, the Uumpus Advisor does 
not have this degree of freedom (as it has only one explanatory 
strategy) ^, and the Learning Model is primarily concerned with a 
vector which represents each student’s learning ability in the three 
dimensional Learning Space. The first dimension of the Learning Space is 
how long it takes the student to learn something (the Repetition Factor) 
and the next dimension is how long it takes him to forget something (the 
Forgetfulness Factor). The Receptivity Factor is the last dimension and 
indicates how often the student likes to be spoken to. How often the 
Advisor gives the student advice depends on the student’s Receptivity 
Factor. There is a large region within the three dimensional Learning 
Space in which the Uumpus Advisor can function effectively. 

The first dimension of the student’s learning ability (the 
Repetition Factor) is a variable which determines how high a Familiarity 
Value the student must have before it can be presumed that he knows a 


Overlay modelling is discussed more fully in 
(1977). 

(xii) Later versions of the Uumpus Advisor wi 
explanatory abilities with a correspondingly increased 
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rule. The Familiarity Value for a rule ie incremented for one of two 
reasons. If the player is not deemed to have heard of the rule in 
question, then the Familiarity \lalue will be incremented when the player 
is told the rule in question. ' 1 ^ Likewise, the Familiarity Value will 
be incremented if the student demonstrates a working knowledge of the 
rule (as determined by the Psychologist). (xiv) Uhenever the 
Familiarity Value exceeds the Repetition Factor, the Student Learning 
Model will presume that the student knows the rule in question. 

Uhen it notes that the student has learned a rule, it also checks to see 
if he has learned the rules requisite to advancing to the next phase. If 
60 , the Student Learning Model will note that it is acceptable to teach 
the additional rules of the new phase. (* V| ) 

The second dimension of the student's learning ability is the 
Forgetfulness Factor. The Student Learning Model normally presumes that 
the student has forgotten certain of the more recently acquired rules 
between sessions. This is done by decrementing the Familiarity Values 
according to the Forgetfulness Factor and how long the student has been 


(xiii) Whether or not the player is deemed to have heard of the rule in 
question is decided by comparing the number of times he has been told the 
rule with a variable that is dependent on his Repetition Factor. Note 
that this requires the Student Knowledge Model to keep track of how often 
the player has been told a rule as well as his familiarity. 

< x ' v) Familiarity Values are a rather primitive implementation of the 
Model in Figure 5.1 as the two external inputs are combined linearly. 

(xv) It is worth noting that the test value for having heard of rules is 
such that it is not possible for the Student Learning Model to presume 
that the player knows a rule without the player having demonstrated a 
working knowledge of the rule at some time. 

(xvi) E ven after the student is presumed to have mastered a rule, the 
Familiarity Value is updated whenever the student demonstrates a working 
knowledge of the rule. 
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away from the Uumpus Advisor. ( * VI|) The Familiarity Values are 
decremented by an amount equal to the log of the number of days that the 
player has been away from the Uumpus Advisor (plus one). The base of the 
log is the inverse of the Forgetfulness Factor. 

The Student Model Critic 

The Student Model Critic analyzes the player’s behavior whenever 
the player makes a move which indicates a working knowledge of a rule 
which he is not presumed to know, and when he makes a move which he 
should have recognized as better (these situations are identified by the 
Psychologist, of course). In such cases the Critic will often adjust the 
student s Repetition Factor or Forgetfulness Factor, whichever is 
appropriate. Also, if the player follows the advice of the Uumpus 
Advisor, the Critic will increase his Receptivity Factor. In contrast, 
if the player ignores the advice of the Advisor and makes moves which the 
Advisor has noted as bad, the Critic will decrease the player’s 
Receptivity Factor, (xviii) These small incremental changes to the key 
variables of the Student Learning Model are intended to adjust the 
Learning Model to the actual student’s learning ability. 

The Student Model Critic makes an analysis of the situation before 
making any changes to the Forgetfulness Factor or Repetition Factor. Its 
analysis is guided by the six Critic rules shown in Figure S.4. The 
Critic first notes whether or not the player '19 a new player; in such 


The values representing how often the player has been told each 
rule are also decremented as his memory of the advice given is presumed 
to have decreased over the given time period. 

(xviii) The amount uhich the Critic increases or decreases the student's 
receptivity depends on how recently the advice was given. 
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cases, it is quite likely that the initial estimate of the student’s 
knowledge was incorrect, and so the Critic will only make immediate 

adjustments to the Knowledge Mode I. This is formally stated in the first 
two Critic rules of Figure 5.4. The Critic will presume that it has an 

acceptable approximation in the Knowledge Model when it has to undo a 

decision made earlier. For example, if the Critic repeatedly decreased 
its estimation of the student’s knowledge, it would stop adjusting the 
Knowledge Model as soon as the player learned a rule which he had 
previously demonstrated that he did not know.* XIX ^ This is formally 

stated as the third Critic rule. Once the Critic feels that the 

Knowledge Model has been adequately adjusted, it will begin adjusting the 
Repetition Factor and Forgetfulness Factor, Houever, the Advisor will not 
make any changes to the Factors if the student’s Familiarity \Jalue for a 
rule in question is not relatively close to the Repetition Factor. If it 
is not fairly close, it is far more likely that this case is not 

indicative of the student's learning ability, but, instead, a fluke (a 
mistake on the part of the player or a lucky move). ^ xx ^ 

Once it is determined that the move in question involves a rule 

with a marginal Familiarity Value, the Critic will make the adjustment 
which is indicated by the last two Critic rules. 


(xix) B ec ause game situations often allow the student to more quickly 
demonstrate a knowledge (or the lack thereof) of rules with respect to 
certain dangers, the Critic notes whether or not the Knowledge Model has 
been adjusted with respect to each danger. 

* xx ^ The Critic will not make any changes to the Learning Model if the 
Familiarity Value was set in the initialization period. 
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Figure 5.4 

C/?i If a new player demonstrates a working knowledge of a rule which 
he was not initially presumed to know, adjust the Familiarity 
Value to indicate a knowledge of said rule. 

CR2 If a new player indicates that he does not know a rule which he 

was _ initial Iy presumed to know, adjust the Familiarity Value to 

indicate that he does not really know said rule. 

CR3 Cease adjusting the initializations of the Familiarity Values as 

soon as some decision made according to CR1 or CR2 is 

contradicted. 

CR4 If the player demonstrates a working knowledge of a rule which he 
was not presumed to know, then 

A Decrease the player’s ForgetFulness Factor if this was a 
rule which he was presumed to have forgotten recently. 

B Otherwise, decrease the player’s Repetition Factor. 

CR5 If the player’e actions indicate that he does not really know a 
rule which he was presumed to know, then 

A Increase the Forgetfulness Factor if this was a rule 
which the player was presumed to have learned in a 
previous session. 

B Otherwise, increase the player's Repetition Factor. 


. iviv .i vii rdufiis!!!:':;;' 'test 


In this manner, the Student Model Critic adjusts the values of the 
Student Learning Model as appropriate.^ xx ‘* 

The Student Model Critic also identifies those situations in which 
the limits of the Uumpus Advisor are exceeded and a human teacher is 
necessary. This is done by noting when the player does not seem to be 
advancing after an extended period and, more importantly, by noting those 
situations in which the limitations of the Uumpus Advisor are exceeded. 
For example, within a two dimensional Learning Space composed of the 
Repetition Factor and the Forgetfulness Factor, the Uumpus Advisor is 


XXI At this point it should have become clear to the reader 
Student Model keeps track of when a player is presumed 
I earned/un I earned each rule, which rules uere forgotten, etc.. 


that the 
to have 
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competent at advising students within the area shown graphically in 
Figure 5.5. The Student Learning Model can represent a user as a vector 
anywhere within the marked area. ( * xn) Any time the student’s Learning 
Vector falls outside of this region, the Student Model Critic calls for 
human help as the limits of the Uumpus Advisor will have been exceeded. 

Flgur# 5.5 



Time Before Forgets 

Because of the limitation on the Repetition Factor, the Uumpus 
Advisor can not always keep up with extreme I y quick students, but the 
Advisor is not really necessary in such cases. However, the limitation of 
the Forgetfulness Factor may be very significant, as the Uumpus Advisor 
can not deal effectively with students who are extremely forgetful. The 
limitation is caused by the fact that the Uumpus Advisor keeps track of 
time in days, which could be too large a unit of measure in extreme 
cases. Further testing is required to determine if this is an acceptable 
limitation. Otherwise, it would be necessary to modify the Uumpus 
Advisor to keep track of time in hours or, possibly, minutes. 

(xxii) j he rea der should note that the Uumpus Advisor is not able to 
finely tune its advice for players whose vectors are close to the limits 
of the Uumpus Advisor's abilities. 
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It is essential that the Student Model have the ability to 
recognize its own limitations. From observing various students and their 
interactions with the Wumpus Advisor, the author has noted that one very 
important step in the 6tudent’9 learning ie the development of a good 
representation for information about the warren. Without a good 
representation, it does not seem to be possible to advance to the more 
advanced phases. In all cases, the students have eventually developed a 
good representation on their own (the Wumpus Advisor does not currently 
have the ability to teach good ways to draw warrens). However, it is 
likely that some students would not overcome this obstacle on their own. 
While it is simple to add in the ability to graphically display "good 
warrens to the student, this is almost certainly only one of many 
possible pitfalls for the student that the Wumpus Advisor would not be 



to deal with. In such cases, it is essential that the Wumpus 


Advisor have the ability to monitor itself and determine when something 
has gone awry. 

The Initializations 

The Student Model is initialized according to information elicited 
from the user. Whenever a user starts up Wusor and the Wumpus Advisor 
does not already have a file on disc about him, the Advisor will ask the 
new user a series of questions which are used to initialize the Student 
Model. First, the Wumpus Advisor will ask the student 1) how old he is, 
2) how many years of education he has completed, and his general attitude 
to Math/Sciences. With this information, the Advisor will calculate the 
student’s Repetition Factor and Forgetfulness Factor under the assumption 
that older, better educated students who enjoy the Math/Sciences are 
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better prepared to learn about the Uumpus domain. Once the Student 
Learning Model has been calculated the Advisor will ask the student how 
many games of Uumpus (if any) he has played. This will be combined with 
whether or not the student chose to read the instructions and his 
estimated learning ability to select one of the four possible starting 
states for the Student Knowledge Model. From experimentation it has been 
found that the Advisor advances the Student Model more quickly and 
accurately than it degrades the Student Model (as increasing knowledge of 
the game is normal), and so the initial state of the Knowledge Model is 
generally a conservative estimate. In this manner the Student Model is 
initialized to a state which is hoped to be close to the actual state of 
the student. In any case, it is not of great importance that the Student 
Model be initialized with extreme accuracy as the Student Model has the 
ability to compensate for such errors. 
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Tutoring Strategies 
Chapter 6 

Model of Tutor/Student Interactions 
In Figure S.i there is a model of the interactions between a tutor 
and a student. It is intended to emphasize the cyclical natures of their 
communications. The tutor communicates his explanations to the student 
and the student communicates his understanding to the tutor (feedback). 
The tutor modifies his instructions according to the feedback and will 
elicit feedback if he is receiving insufficient feedback. Typically, the 
instructions are the explanations of a teacher, while the feedback 
includes everything from facial expressions (boredom, non-comprehension, 
etc.) to test scores. This model is of a generalized tutor which could 
be a human instructor or a computer program. 

Figure 6.1 ■ 

Model of Tutor/Student Interactions 



Feedback 


There are several limitations in the computer’s ability to 
communicate. This difference is primarily due to the fact that the 
computer's communications are restricted to the domain of typed 
responses. The computer can not rely on facial expressions or verbal 
signals to emphasize key points. Also, it can not use facial expressions 
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as a source of feedback. For example, it is inappropriate for a computer 
program to use jokes as a feedback eiicitor because the computer can not 
evaluate the response (laughter, snoring, etc.). ' 

In general, a computer program must rely on more formal avenues of 
communication for its instructions, feedback elicitors, and sources of 
feedback. 

Instruction 

The instructions can have various forms. These different 
approaches to teaching can be laid out along a spectrum which represents 
the amount of thinking required of the student. This is shown in Figure 
6.2. A point at the left end of the spectrum indicates that the tutor 
simply presents algorithms about the domain. This approach requires 
little thought on the part of the student and can be highly effective at 
quickly giving the student mastery of the field. ^ 1 ' * The opposite 
extreme is to simply present the problems to the student and state the 
solutions. This requires the student to work out a technique for solving 
the problems. This process is generally very slou and painstaking and 
leaves open the possibility that the student will never master the given 
problem domain. However, once the student has mastered the domain with 
the latter approach, he is very unlikely to forget the learned material 
and those lessons which the student learns in the given domain are more 
likely to be applied in other domains. 

( ii The idea qualifying the type of teaching according to the effort 
required of the student is presented in Winston (1970). 

tii) Of course this discussion presumes that the strategy selected will 
be developed in the best means possible. The approach of presenting 
algorithms to the student could be complicated by using very complex 
language, thereby requiring significant effort of the student just to 
comprehend the tutor. This is contrary to the intent of the algorithmic 
approach, and we do not consider such cases here. 
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Figure E.2 

Spectrum of Instruction Emphasis 


Teaching 

Algoritms 


Explaining 

Examples 


Giving 

Results 


Algebra 

Tutor 


A 

Wumpus 

Advisor 


Wumpus 

Game 


A 


In developing the Uumpus Advisor, it was decided that the Wumpus 
Advisor s instructions should be biased toward explaining results. This 
is shown in the spectrum of Figure S.2. The Wumpus game itself could be 
located at the extreme end of the spectrum (as shown in Figure E.2) as it 
simply supplies the player with results and the player is left on his own 
to develop better techniques for avoiding dangers. The disadvantage of 
the game without an Advisor is, as stated earlier, that the player can 




reach plateaus in his development and not advance to higher levels. The 
Uumpus Advisor generates explanations which are intended to lead the 
student to the accepted solutions. With this purpose, the most direct 
approach would be to present the student with a somewhat simplified 
version of the algorithms used by the Expert, but this approach does not 
seem to be conducive to teaching better ways to think about problems. 

The author has tutored various students in the game of Uumpus, and, 
when he presented the algorithms of the expert, the players quickly 
mastered the game, but there were also no indications that these students 
developed any improved thinking habits. However, when the author relied 
on the Uumpus Advisor and its explanations of the results (as opposed to 
the actual algorithm), he found that the learning process varied. Those 
players who were already logically oriented (such as M.I.T. graduate 


(T\ 
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students) quickly developed algorithms of their own. The players who did 
not possess the desired thinking habits (i.e. those who were not 
mathematically/scientifical ly inclined) found the ideas presented 
intriguing and rather difficult. They would put serious thought into the 
explanations and after many tries mastered the material. Comments were 
made such as "Oh wow, this is a completely new way of thinking.", but 
after having mastered the material they found it difficult to accept that 
the material was so difficult ("iut its so obvious "). (11 15 Because of 
such evidence, the Uumpus Advisor generally does not state its rules 
formally to the student but rather gives explanations of the results. 

In domains where the material to be taught is thought to have more 
intrinsic value (such as Algebra which, it can be argued, can be of use 
in every day chores such as balancing checkbooks), there i9 a better 
justification for teaching with a more algorithmic approach. A computer 
tutor for Algebra might factor several problems, showing all the steps to 
the student. Then it could give the student various problems to do on 
his own to help him remember the algorithms presented. This form of 
instruction is appropriate in those domains where the subject matter has 
Intrinsic value (rather than helping to develop improved thinking 
habits). (iv) 

Feedback 

It is also possible to lay out the different approaches to feedback 


f • ii) The above observations were made with about fifteen students in 
separate sessions, each of which lasted an average of three hours. All 
results were the subjective evaluations of the author. 

Socratic instruction (through leading questions) can be viewed as 
another approach to the problem of non-algorithmic tutoring strategies. 
Socratic questioning is discussed fully in Collins. 
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along a spectrum. Such a spectrum is shown in Figure 6.3. The point at 
the left end of the spectrum is for tutors which ask the student direct 
questions about the subject matter. The point at the right end indicates 
that the tutor estimates the student’s knowledge solely by observing his 
behavior. An Algebra tutor which presents problems to the student and 
observes how he solves them, would be toward the middle of the spectrum. 


Questions About 
Domain Knowledge 

I_ 


Figure 6.3 

Methods of Gaining Feedback 

Giving Problems 
Requiring Knowledge 


-*- 

Algebra 

Tutor 
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. . I 


Wumpus 

Advisor 


The methods used by a tutor to gather feedback are largely 
determined by the teaching environment and the subject matter that is 
being taught. The teaching environment restricts the options available to 
the tutor so that many techniques, such as jokes, are not available to a 
computerized tutor. Also, the subject matter lends itself to various 
feedback approaches. Geography is a field in which it is very difficult 
to evaluate the student’s retention of the material without directly 
asking questions concerning that knowledge. 

The Wumpus game is a domain which is conducive to obtaining 
feedback by observing the student’s behavior. This is true of most games 
environments as the game presents the student with problems to solve; an 


( V } A 

A geography tutor would be expected to ask questions such as, "What 
is the capital of Brazil?", which is a direct testing of the knowledge 
* hat 13 being taught. Just such a geography tutor has been developed bu 
Carbone I I and Collins in SCHOLAR. 
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Advisor need only observe the results. In such environments, it is 
important that the Advisor be careful to take full advantage of these 
observations and minimize the number of questions which it asks of the 
student. If the Advisor does resort to "testing" the student as a normal 
procedure, the student’s enjoyment of the game is likely to be greatly 
diminished, and he is likely to resent the ministrations of the Advisor. 
For these reasons, the Uumpus Advisor relies almost solely on 
observations of the student for its evaluations of his knowledge of the 
gatne. (vi) The only way in which the Uumpus Advisor can be said to be 
eliciting feedback is the manner in which it defines (and perhaps alters) 
the game in order to create environments which are more conducive to 
learning and the evaluation of the student. 

The above two spectrums are, of course, simplifications, but they 
help to emphasize two very important issues of tutoring strategies. 
Other issues that must be considered (and warrant further 
experimentation) are such issues as graphical versus verbal explanations 
and the types of instruction which are most effective (such as proof by 
contradiction, which is one the key methods of the Advisor for simple 
proofs). It is expected that Uusor III will be able to vary its tutoring 
strategies according to the student and will consider each of the above 
issues as well as other of such issues. These additions will increase 
Uusor*s Learning (lode I to the dimensions described in Goldstein’s 
proposal for the Coach Project (1977). 

Conclusion 


<vi) For the purpose of testing the validity of the Advisor’s deductions, 
a useful experiment is to ask questions of the student to determine the 
correctness of the Advisor’s estimation of his knowledge. 
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In designing Uusor II, there were several possible approaches to 
the problems of presenting the material and evaluating the student’s 
response. The Advisor presents explanations of the results of the 
algorithms, instead of the algorithms themselves, as this seemed to be 
more conducive to teaching better ways to think about problems. 
Likewise, it was decided that the feedback from the student should be 
gathered primarily by observing the students behavior as this would not 
detract from the students enjoyment of the game and would help maintain a 
positive feeling on the part of the student toward the Advisor. 
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The English Generation Routines 
Chapter 7 

The English used by the Uumpus Advisor is generated in a 
conceptually clear manner. For each of the routines/rutee of the Expert, 
there is a corresponding English routine/rule. If a routine of the Expert 
uses the results of a lower level routine, the corresponding English 
routine will likewise insert the text generated by the corresponding 
lower level English routine into its text. *** In this fashion, it is a 
quite simple to write short functional English generation routines. The 
problem with such an arrangement ie that the English thereby generated 
tends to be rather long and awkward. For this reason, various additions 
were made to the basic framework just described to improve on the output 
of the English routines. In the remainder of this chapter, we will 
describe those additions which were made to this simple approach. 

A very important addition to the English routines was to make them 
aware of what the player already knows. In this fashion, the English 
routines can omit those portions of explanations which the player is 
thought to already know. This is not extremely important in the earlier 
phases of the curriculum, but it becomes more important later on. As the 
Uumpus Advisor teaches progressively more advanced rules, it relies 
increasingly on the simpler rules which have already been taught. If the 
English routines do not omit the lower level explanations, the 
explanations become quite long (and it becomes extremely unlikely that 


*** The approach of having the explanation routines model the decision 
making routines is fully described in Stansfield (1375). 
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the student will read the explanation at all). The general solution to 
this problem is to have each English Routine first check to see if the 
player already knows all of the rules involved in its explanation. If 
so, then the English routine will not generate a complete explanation, 
but will instead only summarize its results. This simple check generally 
suffices to prune out the unnecessary parts of the explanations. 

Figure 7.1 


pit 


7 wumpus (?) 

Circled caves have been visited. 

Upper Case: Given Facts 
Lower Case: Deductions 

For example, a moderately advanced player was faced with warren 
shown in Figure 7.1 and he received the following explanation of why cave 
15 was very likely to contain bats (P13). 

One of caves 15 and 7 contains bats. Likewise, one oT 
caves 5 and 15 contains bats, and one of caves 16, 15, and 9 
contains bats. As you have multiple evidence for bats in cave 15, 
it is very likely that cave 15 contains bats. 

This explanation presumed that the player already knew all the rules 
necessary to derive cave-sets, so that the explanation routines simply 
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listed the cave-sets concerned rather than explaining each one. However, 
once the student model was modified to indicate no knowledge of the 
required rules, the following explanation was obtained: 

One of caves 15 and 7 contains bats. Cave 19 must be 
next to bats because we heard squeaking in cave 19. Hence one of 
cat/es 15, 7, 8, and 4 contains bats, but we have safely visited 
cave 4. Also, cave 8 can not contain bats because if bats were in 
cave 8 we would have heard squeaking in cave 4 as cave 8 is next to 
cave 4. This means that one of caves 15 and 7 contains bats. 
Likewise, one of cai/es 5 and 15 contains bats. Cave 0 must be next 
to bats because we heard squeaking in cave 0. Therefore one of 
caves 5, 15, 4, and 3 contains bats. However we have safely 
visited caves 3 and 4. This means that one of caves 5 and 15 
contains bats, and one of caves 16, 15, and 9 contains bats. Cave 
2 must be next to bats because we heard squeaking in cave 2, and so 

one of caves 16, 15, 9, and 3 contains bats, but we have safely 

visited cave 3. This means that one of caves 16, 15, and 9 
contains bats. As you have multiple evidence for bats in cave 15, 
it is very likely that cave 15 contains bats. 

The Uumpus Advisor would never have made such an explanation under normal 
circumstances (ii) , but this example does point out the importance of 

pruning the English for even moderately advanced explanations. Without 

pruning, explanations quickly become so long that the the overall thread 
of the explanation is lost. 

<ii) Normally the Curriculum would prevent the Advisor from explaining 
rule P13 before the student had thoroughly mastered all of the rules 
involved in explaining cave-sets, but for this example the Curriculum was 
disab Ied. 
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Another problem with the very simple model of the English routines 
is that such a model does not consider the idiosyncracies of the English 
language. For example, the English routines should not say "one cave 
away from" or "zero caves away from" (as was common in this paper), but 
rather it should say "next to" and "contains". There are a great many of 
such special cases (most of which center around the end conditions), and 
the English routines must have this knowledge programmed into them. Such 
problems can be handled with little deviation from the conceptual model 
if they are handled modular I y. ^ 

In order to finely tune the English routines so that its output 
consistently sounds natural, each English routine should be written with 
an idea of the context in which it will be used. An example of this is 
the amount of introduction given by an English routine for a lower level 
English routine s explanation. If the lower level routine returns only a 
short summary, then only a short introduction (if any) is warranted. In 
contrast, if the lower level routine returns a lengthy response, a longer 
introduction is warranted, and, in some cases, the entire explanation 
should be restructured. Such issues can be resolved relatively easily by 
considering the curriculum, and the overall hierarchy of the rules. It 
is unrealistic to presume that the English routines which explain complex 
probability rules will ever be called before the student has mastered the 
simpler probability rules. Uith this in mind, it is reasonably safe for 
the more complex probability routines to presume that the student will 
have thoroughly mastered cave-sets by the time they are called. This 




The solution to the problem stated above is simply a routine that 
converts "N caves away from" into its English equivalent. 
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reliance on the structure of the curriculum allows the context 
sensitivity to be static. 

The addition of various checks to the conceptual framework 
originally described has changed what would have been very simple 
routines into relatively long and complex routines. An alternative method 
of improving on the English generated by simple routines is to use a 
standard English Generation Front End as shown in Figure 7.2. The Front 
End would accept the relatively "rough" output of the English Generator 
and then improve on i t by performing such tasks as pronoun insertion, 
introduction modification, phrase replacement, etc.. This would have the 
effect of improving the overall quality of the English generated (as 
great effort has gone into developing such Front Ends) (iv) and would 
also make the task of generating English simpler. (Less knowledge of the 
English language would have to be programmed into the Advisor’s routines 
as that knowledge is already programmed into the Front End). 

Figure 7.2 

Diagram oj an English Generation System 
Arrows Represent Data 
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(iv) For more detailed information about English Generation Front Ends, 
see the work of McDonald, Novak, Simmons, Schank, etc. 
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Testing the Advisor 
Chapter 8 

The Wumpus Advisor is currently being tested to verify that is 
does, in fact, function as described. Also, a great many of the 
decisions which were made during the design of the Wumpus Advisor were 
based on the judgement of the author? these issues warrant further study 
to determine if the approaches taken were correct. Finally, it is 
important to determine whether the Advisor succeeds in teaching better 
ways to play Wumpus and, more importantly, better ways to think about 
problems. The tests can be divided into two general classes. They are 
local tests designed to test the ability of the Advisor to perform 
specific functions and global tests intended to test the overall ability 
of the Wumpus Advisor. 

Local Tests 

Local tests have been made with two different approaches to the 
problem. The first was a series of sessions in which students used the 
Wumpus Advisor in the presence of the author. In one such session, it 
was noted that negative logic (explaining the absence of warnings) was 
much more understandable than the approach then being used by the 
Advisor. Prior to this, the Wumpus Advisor would generate explanations 
like: 

Cave 0 can not contain the Wumpus because we did not smell the 
Wumpus in cave 2. This means that cave 2 is more than two caves 
away from the Wumpus, cave 1 is not next to the Wumpus, and cave 0 
can not contain the Wumpus. 

This (positive logic) seemed unnatural to students, and they responded 
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much better to the negative logic which is currently used by the Uumpus 
Advi6or. In the same situation, the Uumpus Advisor would explain: 

Cai/e 0 can not contain the Uumpus because, IT the Uumpus were 
there, we would have smelled the Uumpus in cave 2. 

There were approximately ten such sessions of roughly three hours 

duration, and, based on subjective evaluations of the sessions, it was 

determined that in general the Advisor functioned as desired, but it was 

not possible to determine how well the Uumpus Advisor would function in 

the wide range of situations it would eventually face. 

The next group of experiments to test the Uumpus Advisor’s ability 

to function as planned were conducted with a Synthetic Student. The 

Synthetic Student (Syndi) was designed to play the game of Uumpus in 

conjunction with the Uumpus Advisor (Uusor). Syndi is a perturbation of 

the Expert of Uusor with the main change being the addition of 

conditional statements which allow Syndi to play with some subset of the 

full Expert. Syndi has a Knowledge Model of her own and her Expert will 

not apply any rule unless the Knowledge Model indicates that she 

possesses knowledge of said rule at that time. Using Syndi, a great deal 

of debugging of Uusor was achieved with relative ease. Uusor’s analysis 

of the student’s move involves several inherent assumptions and Syndi was 

designed to test Uusor’s analysis given that these assumptions are true. 

The assumptions are: 

1. The student does not apply paradigms unknown to the Uumpus 
Advisor. 

2. Uhen faced with several equally desirable choices for moves, 
the player will choose that move which is closest (and hence 
easiest to get to). 


The idea of using a Synthetic Student to test an Advisor program was 
originally described in Goldstein’s Coach Proposal (1977). 
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3. The player is consistent in his application of rules. He does 
not have partial or incomplete understanding of a given rule 
(the undesirability of this assumption is decreased by the 
almost atomic rules of the Expert). 

4. The player does not make mistakes. Uusor’s relative caution is 
necessary because this assumption is obviously false. 

5. The player comprehends the explanations given by Uusor. 

Syndi was designed to satisfy all of the above assumptions. Also, as she 
had a distinctly diagnosable knowledge of the game at all times, it was 
quite simple to verify any results obtained by the Advisor. Several 
experiments were run with Syndi and Uusor, and they can be grouped into 
three types; 

1. Experiments in which Uusor initially presumed Syndi had no 
knowledge of the game and Syndi played with a constant 
knowledge state. Ideally, Uusor should advance his estimation 
of Syndi’s knowledge to exactly correspond to her actual state 
and not beyond. 

2. Experiments in which Uusor initially presumed Syndi had a 

complete knowledge of the game and Syndi played with a constant 
knowledge state. Ideally, Uusor should degrade his estimation 
of Syndi’s knowledge to agree with the actual state. 

3. Experiments in which Syndi started as a new player with no 

knowledge of the game, and "learned" rules as Uusor explained 
them to her. Ideally, Syndi’s knowledge should increase to 

complete knowledge of the game with Uusor’s estimation lagging 
a little behind each change. 

These experiments are relatively complete and thoroughly test the 

Advisor s functioning given that the above assumptions are true. In 
experiments of Type 1 (advancing to a fixed knowledge state), it is 
acceptable to always start with Uusor presuming Syndi has no knowledge of 
the game because Uusor must advance his estimation through all of the 
phases up to the desired phase to achieve the desired result. This also 
justifies starting all Type 2 experiments (degrading to a fixed knowledge 
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state) from an initial estimation of complete mastery of the game. 
Experiments of the first two types serve to test the existence of 
situations which will allow Uusor to make deductions as well as testing 
the correctness of said deductions. The third type of experiment is 
necessary to insure that there are situations which allow Uusor to tutor 
a student from the very basics to the most advanced skills in a moderate 
amount of time. 

Uhen conducting the above experiments, several modifications were 
found necessary. Hany of the experiments with Syndi did not achieve the 
desired results on their first run, which required that Uusor be modified 
in light of the preliminary results. Then the process would be repeated. 
In this section, we will discuss the experiments which were run, the 
problems which arose, the solutions which were found {if any), and the 
final results achieved. All this will be in general terms with the 
specific results achieved shown in Appendix F. 

The first experiments were run with Syndi having a complete 
Knowledge of the game and Uusor initially presuming that she had no 
knowledge of the game. It was found that Uusor advanced the student 
rapidly with respect to the Uumpus domain, but unreasonably slowly with 
respect to the bat/pit domain. Because of this, several modifications 
were required before Uusor would correct its estimation of the student s 
knowledge to complete mastery of the game within a reasonable amount of 
time. The first problem was that Syndi would risk bats far more 
regularly than pits (as bats are not necessarily fatal) so that Uusor had 
very few chances to observe the application of the probability rules with 
respect to pits. This required the modification of the student modelling 
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components so that the bat/pit Familiarity Values would be associated 
together once Uusor had determined that the player had mastered the rules 
to determine which caves are safe (i.e. Uusor would presume a 
transference of bat/pit knowledge once the player had mastered Phase 1). 
The next modification was required because there were often long periods 
in which there were no situations which would allow Uusor to deduce a 
mastery of probability rule Pll for bats/pits. In order to overcome this 
.difficulty. Uusor was given the ability to modify the initial game 
situation to create ideal situations appropriate for a player of the 
given phase. These ideal situations are described in Appendix C and 

illustrated in Appendix D. The desired results uere achieved by combining 
the above modification with another modification! the criteria for 
deducing that the player had mastered a given skill was lowered somewhat. 
Previously, the situation had to have been such that the selected move 
was better than some other move with respect to a single danger and 
identical (no differences) with respect to the other two dangers. Then 
Uusor would note that the player had given the appearance of mastery of 
those rules which were required to justify the superiority of the better 
cave for the relevant danger. This was relaxed so that, if there were a 
single deciding danger which made the selected move superior, the two 
moves had to be one of the following with respect to each of the two 
remaining dangers; I) identical, 2! such that the player should have 
perceived that they involved the same risk, or 3) such that the player 
should have perceived that the better move involved a greater risk. 
These two modifications, along with fixing various outright bugs, allowed 
Uusor to advance the student to complete mastery of the game in roughly 
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thirty games (a figure which is considered acceptable for a tutor 
designed to work, with children). 

The next series of experiments were such that Syndi used a fixed 
subset of the expert’s knowledge and Uusor started with an initial 
estimation of no knowledge of the game. In these experiments, Uusor 
advanced rapidly to the actual state (as it did before), but rather than 
stopping, it continued to advance the student slowly, albeit surely. 
This required the addition of the more complete analysis for 
probabilities described in Appendix B. Of course the original 

experiments had to be run over again, but it was found that in all cases 
Uusor would advance to the correct knowledge state and then make no 
further presumptions of the student's knowledge. The advances proceeded 
at an acceptable rate. The only non-optimal performance by Uusor was due 
to problems in implementing Syndi. In developing Syndi it was found that 
it was necessary that she have some minimal knowledge of the game at all 
times in order to model even beginning players. The required knowledge 
was an understanding of what it meant to have encountered a danger, what 
it meant to have safely visited a cave, and a rudimentary understanding 
of what it meant to shoot into a cave. The problem was that the 

rudimentary understanding of the shooting principle (P15) would 
occasionally cause Uusor to advance the student slightly beyond the 
actual knowledge state, assuming a full mastery of rule P15 and the rules 
associated with it when, in fact, there was only a partial mastery (i.e. 
Assumption 3 was violated). 

The next set of experiments were of the Type 2 described earlier. 
Syndi was started with a fixed, but limited knowledge of the game, while 
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Uusor initially assumed that she had complete mastery of the game. The 
ideal was that Uusor would degrade his estimation of Syndi’s knowledge 
until it corresponded to the actual knowledge and then make no further 
change. Several experiments were run with different knowledge states for 
Syndi. Many of the same problems that were encountered in advancing the 
student were encountered in degrading the knowledge estimation. It was 
found that there were not sufficient game situations which would clearly 
reflect on the application of rules Pll for bats/pits (as in advancing 
the student for the same rules). The problem was that the ideal 
situations which had been introduced to aid in the advancement of the 
student were always one level more advanced than would be necessary to 
degrade the student’s knowledge model. Fortunately the implementation of 
the ideal situations had been such that Uusor was only able to create an 
ideal situation approximately one fourth of the time (ii) , and this 
allowed Uusor to try to create an ideal situation for a student of the 
next lower level (and the next more advanced level) if it failed to 
create the ideal situation for the given level. This was sufficient to 
overcome this problem. 

Another problem that was encountered when running the Type 2 
experiments was that Syndi sometimes would not risk pits often enough to 
allow Uusor to analyze her knowledge of the pit rules. This problem had 
been encountered earlier when testing Uusor’s ability to advance the 
student and had been solved by assuming that knowledge in one of the 
bat/pit domains would be transferred to the other domain. A solution to 

This was done to restrict the possibility of the student 
that Uusor was modifying the game. 


deducing 
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the problem would have been to completely tie the two domains together 
with a resulting lack of flexibility (since Uusor would not be able to 
diagnose failures to transfer knowledge from one domain to the other). 
It was decided that the decrease in flexibility was not warranted in 
light of the fact that decreasing Uusor’s estimation of the student’s 
knowledge was not the normal mode in which Uusor would function. This 
same trade off came up when it was noted that Uusor would sometimes 
(though rarely) note that the player had demonstrated a knowledge of 
rules which the student did not, in fact, possess. This was caused by 
the reduced requirements for demonstrating knowledge of a rule which were 
implemented while running the Type 1 experiments. In particular, because 
Uusor assumed that the student would recognize that a move was worse 
because of a certain danger (and in this case that was a false 
assumption), and therefore conclude that he he must have seen the virtues 
of the move with respect to another danger (since he chose the move). 
This was determined to be a minor problem because the problem was very 
unlikely in normal situations (since Uusor would not be that far off in 
his estimation of the student’s knowledge) and because Uusor eventually 
corrected for the error. To return to the more stringent requirements 
would decrease Uusor's ability to advance the student (his normal mode) 
without significantly affecting Uusor’s ability to decrease his 
estimation of the student’s knowledge. 

The above two sets of experiments showed that Uusor was completely 
accurate in its ability to increase its estimation of the student’s 
knowledge to the appropriate level (and not beyond) when its intrinsic 
assumptions were valid. It was also found that, when the assumptions 
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were violated (in particular, the assumption that the student would not 
have partial or incomplete understanding of a rule), Uusor could diagnose 
the general level of the student, though its estimation of the student’s 
knowledge of particular rules was not always reliable. It was also found 
that Uusor had a limited ability to decrease his estimation of the 
student’s knowledge, though this mode was not nearly as reliable. In 
light of these results, Uusor’s initializations of the student’s 
knowledge were always kept on the conservative side. (iii) Having 
verified that Uusor’s analysis of the student’s actions were working, it 
was then appropriate to insure that Uusor could give advice on all the 
different aspects of Uumpus play. This was to insure that, if the 
student could understand Uusor's explanations, it would be possible for 
him to become an expert in Uumpus play without any outside help (i.e. 
there were not any holes in Uusor's curriculum). 

The final set of experiments were of Type 3, in which Syndi started 
with a minimal knowledge of the game, but would learn each rule as soon 
as it was explained to her. After a certain level, Syndi would begin 
transferring knowledge between the bat and pit domains. Uusor started 
with a correct estimation of Syndi’s ability. Of course, Syndi’s ability 
to learn is rather artificial. Uhen Uusor notes that he has just 
explained a rule to the student, Syndi will be so informed as well 
(thereby allowing her to begin applying the rule in question). Another 
technical problem was to decide how Syndi should modify her behavior when 




The claim.that Uusor’s initializations of the Student Model tends 
to be conservative is justified by the fact that, in real usage, Uusor 
typically starts the student at a certain level and quickly advances the 
student to a more appropriate level; after which 
more slowly. 


advances tend to come 
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she first learned a rule. Should she recalculate her entire data base or 
change it ae her attention was called to It. This was very important as 
human players do not take either extreme (but, instead, do a little bit 
of both). The general tack that was taken was to emulate a human player 
a 9 much as possible. For example, if a player did not previously 
understand the significance of warnings/nouarnings, it is unlikely that 
he would have kept a record of this information. When he did learn their 
meaning, he would have to revisit a cave before he would able to use the 
information of its warnings. So it was with the data base with Syndi. 
This is not true of the more complicated rules as they simply involve 
better usage of the available information. This aspect of human players 
was also modelled in Syndi. 

In the Type 3 experiments it was found that Uusor was able to find 
situations to advise the student for all levels of knowledge of the 
bat/pit domain (again, this was largely due to Wusor’s ability to modify 
the game to create ideal situations for bat/pit knowledge), but Uusor was 
not able to ever advise the student with respect to the rules of Phase 2 
of the Uumpus. This was because of a bug which made Uusor unnecessarily 
restrictive in the situations where it would advise the player about 
these rules. (iv) This bug was fixed, and Uusor then began explaining to 
Syndi the rules of Phase 2 for the Uumpus. Once this obstacle was 
overcome, it was found that the situations in which Uusor would explain 
the rules of Phase 3 for the Uumpus were extremely rare. This problem 
was fixed by creating a new Combination Rule, Combination Rule Seven. The 


The bug was mutually exclusive requirements prior to the explanation 
of Combination Rule 5. 
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above two modifications allowed Uusor to advance Syndi from the most 
basic levels to the most advanced levels by first explaining a rule and 
then observing when Syndi applied the rule. In Appendix G there is the 
first part of the protocol taken in a Type 3 experiment. It is indicative 
of what all the protocols look like. 

The last three of the five assumptions are most questionable. The 
above experiments indicate that Uusor functions a 9 desired even when the 
third assumption (the assumption about incomplete understanding of rules) 
is violated. The fourth assumption that the player does not make 
mistakes is obviously invalid, but, because of this, Uusor was designed 
to be cautious in its conclusions (thereby decreasing the detrimental 
effects of mistakes). The remaining questionable assumption is the 
assumption that the player understands Uusor’s explanations. There is 
evidence that Uusor is, in fact, effective at tutoring students in his 
game strategies. Uhile the above experiments were being carried out, 
Uusor was accessible to the general community at the Artificial 
Intelligence Lab so that anyone who was interested could play with Uusor. 
Roughly 15 students played with Uusor extensively with reasonable 
results but there was one disturbing anomaly. It was known that 
there were more situations which allowed the student to demonstrate 
knowledge of the Uumpus domain because Uusor advanced his estimation of 
Syndi in this domain much more quickly than in the bat/pit domain. 
However, when Uusor was being used by actual students it was observed 
that knowledge of the bat/pit domain far exceeded the Uumpus domain 


This is the subjective evaluation of the author based on his study of 
the complete protocols kept by Uusor. 
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beyond Phase 1. It was feared that this indicated that Uusor assumptions 
were not valid, but then it was discerned that Uusor was unable to tutor 
students on the more advanced Uumpus rules. This, in itself, is 
indicative that students do tend to adopt those strategies explained by 
Uusor, though stronger evidence will be gained if students advance more 
quickly in the Uumpus domain now that Uusor can advise them in these 
ski I Is. (vi) 

When a large tutorial program, such as Uusor, is designed with such 
varied behavior, it is extremely difficult to determine whether or not 
the program functions as desired. However, this testing can be vastly 
simplified through the use of a Synthetic Student. Syndi served to test 
Uusor in closed experiments where all variables could be controlled 
(Syndi never had a bad day). Of course, experiments with Synthetic 
Students do not conclusively show the effectiveness of a tutor, but they 
do serve as a wonderful debugging aid. Uith a Synthetic Student, it is 
possible to determine whether or not the program functions as desired; 
the determination if the correct approach was taken has to be determined 
through other experiments which are of a more global nature. Ue will go 
on to describe these global experiments in more detail. 

Global Testing 

The Global Testing is the next step in testing the Uumpus Advisor. 
One of the goals of this phase of experimentation is to test out the 
relative merits of different tutoring strategies, such as graphical 
versus verbal. However, the main goal of these experiments is to 


(vi) The reader can also verify that Uusor's explanations are 
comprehensible by studying the scenarios in Appendix E and Appendix G. 
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determine how well the Uumpus Advisor actually teaches better game 
playing (if it does, in fact, help students master the game). The 
experiment would probably be a rather simple arrangement involving a 
total of roughly ten students. Five of these students would be allowed 
to play the game of Uumpus (as modified in this paper) without the 
Advisor. The remaining five would be allowed to play the game with the 
advice of the Uumpus Advisor. At the end of the testing period (which 
would almost certainly extend over a period of about two weeks to a 
month), the students would be tested on their understanding of the game. 
There would be the objective results of the percentage of games won and 
lost when the students were allowed to play without any help, but there 
should also be subjective results. The students should be tested on 
their ability to make deductions, their ability to estimate dangers, and 
their general understanding of the game. It is expected that the 
students who played with the Advisor will not only play better (as 
indicated by their percentages of wins and losses) but also have a better 
understanding of the game. 

The final experiment is to test the Uumpus Advisor’s ability to 
teach better ways to think about problems. Only a genera! outline of the 
experiment will be given as the details will be dependent on the results 
of the previous tests. The experiment should involve about 150 students 
to allow for statistical evaluation. At the start, all 158 students will 
be given a test that measures their ability to think about problems 
(probably an I.Q. test with emphasis on thinking ability). After the 
test, the students will be separated into homogeneous groups of 50. The 
f irst group of 50 students wi I I be left alone for six months. The second 
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group will play the game of Wumpus without the Uumpus Advisor. The last 
group will play the game with the Uumpus Advisor. The experiment will 
continue for six months with the active students playing Uumpus for a 
specified period each week. Finally, all the students will be given a 
variation of the original test. In this manner, it should be possible to 
test both the game of Wumpus and the Advisor in their ability to teach 
better thinking habits. 

The large size of the groups to be tested is essential as it is 
doubtful that any one force can be that significant in improving thinking 
habits. With a smaller sampling size it is possible that the small 
increments in thinking ability would be lost in the normal noise that 
accompanies such tests. It is also possible that the results would not 
be conclusive even with such a large sampling, but there is no way to 
predict this. A significant number of the subjects in the standard 
curriculum of the public schools are justified by the hope that mastery 
of the given field will help the student develop better ways to think 
about problems. However, no one seems to have made a significant effort 
to verify these claims of improved thinking habits, and so it is very 
difficult to estimate the sample size which will be necessary to arrive 
at conclusive results. The proposed test should help to clarify how large 
a sample is required to arrive at conclusive results in such a test. 

Conclusion 

The Uumpus Advisor has been thoroughly tested to debug it and fine 
tune its tutorial ability. Concurrently, consideration was given to the 
various programming decisions which were made according to the judgement 
of the programmer due to the lack of concrete evidence. The Advisor has 
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been determined to be "ready," testing is in progress to determine the 
Advisor’s ability to teach better game playing. This experiment will be 
on a relatively small scale because it is expected that the results will 
be conclusive in any case. However, when the Wumpus Advisor is tested on 
its ability to actually teach better ways to think about problems, the 
results are not expected to be as obvious. For this reason, a larger 
sample size is necessary if the experiment hopes to be conclusive. 
Finally, based on the results of these experiments, further work leading 
to Uusor III could be warranted. 
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The Conelusion 
Chapter 9 

A new generation of Computer Aided Instruction (CAI) is evolving 
which transcends past limitations by using Artificial Intelligence (AI) 
techniques. These new programs are AICAI programs since they incorporate 
the concepts of Artificial Intelligence while maintaining the goals and 
objectives of Computer Aided Instruction. AICAI is a field which 
contains many interesting and challenging problems, the Uumpus Advisor 
being just one example. 

The Uumpus Advisor applies many of the techniques developed in 
Artificial Intelligence. It centers around a rule-based Expert which 
uses heuristics to arrive at results that are both reliable and 
understandable to humans. The rules of the Expert are organized into a 
curriculum which is taught by the Uumpus Advisor as appropriate. The 
rules of the Expert are also the foundations for the Student Knowledge 
node I. The Student Learning Model interprets the Student Knowledge Model 
and determines which rules the student is presumed to have learned and 
which rules are acceptable for teaching. In making these determinations, 
the Student Learning Model makes extensive use of the Student Learning- 
Vector. The Expert’s rules are also the basic framework around which the 
English generation routines are built. The fact that both the Student 
Model and the English Generator are designed around the rules of the 
Expert allow the English routines to prune their English according to the 
student’s knowledge. The logical grouping of the Expert’s rules in the 
curriculum also allows the English routines to be context sensitive 
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without lengthy computations. The essence of the improved Wumpus Advisor 
is the different rules of the Expert and their interreI ationships. 

The main limitations of the original Wumpus Advisor were due to the 
lack of a clear understanding of the knowledge to be taught. Once this 
knowledge had been broken down into specific items, the development of 
the remainder of the Wumpus Advisor was relatively straight forward. In 
more interesting problem domains (such as chess), it will probably not he 
as difficult to develop the rules for the expert, because the problem 
domain will have been analyzed before the program is begun. The main task 
will be to break down the knowledge of the problem domain into rules. 
This will allow more effort to be directed into those areas which are of 
more theoretical interest (such as the Student Model). 

The next logical step beyond the Wumpus Advisor ui I I be a program 
that teaches skills in a problem domain that can not be analyzed with 
absolute certainty. An important asset of the game of Wumpus was that i t 
was possible to discern a single best move and explain why. In problem 
domains such as chess, it is not possible to know absolutely the quality 
of most moves. This would handicap an advisor since there would always be 
the possibility that the expert would not give correct advice. However, 
this is a handicap shared with human teachers. An advantage of more 
complex problem domains is that the Advisor will be able to teach 
principles and concepts which can not be represented within the limited 
domain of Wumpus. For example, the formal methods of decision theory 
could not be taught effectively within the Wumpus domain, but it could be 
taught in more complicated domains. An advisor which functioned in a 
more complicated domain could be built along the general lines of the 
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Wumpus Advisor, but its Student Model would require significantly more 
effort. An advisor for a more complicated domain would need to be more 
flexible in its explanations to the student since the advisor would not 
ba as certain of the results of its expert. The primary source of this 
increased flexibility would be the Student Model. 
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Modular Diagram of the Wumpus Advisor 
Appendi x A 

Each sub-module of the Wumpus Advisor has a unique prefix which 
identifies all the routines of that sub-module. These prefixes are used 
to identify the sub-modules in the flow chart. Each sub-module performs 
a particular task which is described in the remainder of this Appendix. 

The Wumpus Executive 

UE- is the prefix for the Wumpus Executive routines. 

WG- is for the Wumpus game. 

The Wumpus Advisor 

UA- is the prefix for the Wumpus Advisor routines. 

UAW- is for the routines of the Wumpus Advisor which keep track of 
the route taken by the player. 

WAD- is for the routines which compute the distance to different 
caves from a given starting point. They also compute routes, etc.. 

UAN- is for the Wumpus Advisor routines which modify the game and 
create the various starting situations. 


The Expert 

XX- is for the executive for the Wumpus Expert. It activates the other 
sub-modules and then combines their results, 

XD- is the prefix for the routines which maintain the data base for 
the Expert. They also compute the "MORE-THAN" and "EXACTLY" values. 

XS- routines keep track of the different cave-sets, noting which 
are complete, etc.. 

XR- routines identify those caves which must be safe based on 
global consideration of the cave-sets for a particular danger and the 
number of the dangers in the warren. 
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XP- is the prefix for the routines which compute the probabilities 
for the different dangers. 


The Move Comparer 

CX- is the prefix for the Result Explainer routines. They return 
the rules which are necessary to justify the results of the Expert. 

CM- is for the Explanation Comparer, which compares the rules 
returned by the Result Explainer and determines how familiar the student 
is with the rules in question. 


The Psychologist 

PS- is the prefix of the Psychologist. 


The Student Model 

SK- ie the prefix of the Student Knowledge Model. These routines keep 
j iO track of the student’s knowledge of the different rules. 

SL- is the prefix of the Student Learning Model. These routines estimate 
how quickly the student learns material, forgets same, etc.. 

SC- is the Student Model Critic. It adjusts the Student Learning Model 
as appropriate. 

SF- routines maintain the disc files on the different users. 


The English Generation Module 

EX- is the prefix for the routines which generate English to explain the 
results of the Expert. 

EG- routines generate clauses in English that convey a particular 
concept (such as "N caves away from"). 


Utility Routines 


GP- is the prefix for 
abilities of LISP. 

G- routines are 


general purpose utility functions which expand 
the I/O routines of the Wumpus Advisor. 


the 
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GQ- routines query the student uith standard questions and return 
his response once it is in an acceptable form. 

Gil- are general purpose routines uhich do mapping (such as a 
routine to do MAPC non-destructive I y). 

GC- routines do the processing of the circular list uhich are used 
in English generation and FIFO stacks. 


PLAYER is an extraneous variable in an otherwise conceptually clear 
framework. This anachronism will be eliminated as soon as possible. 
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Hodular diagram of the Uumpus Advisor 
Dotted lines represent significant flows of information. 
Solid lines illustrate the hierarchy of control. 



General Purpose 
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Move Comparisons 
Append i x B 

The Hove Comparer is responsible for taking two moves and 
justifying their relative merits with respect to a single danger with 
specific rule numbers. For this purpose there are two separate sub- 
modules to perform this task. These two sub-moduleB are the Result 
Explainer and the Explanation Comparer. First, the Result Explainer 
determines which rules were used by the Expert to arrive at a specific 
result. Then the Explanation Comparer takes two results and compares 
the different rules involved. In this fashion, the Hove Comparer can 
return the rules which justify the difference between two possible moves 
with respect to a single danger. The Move Comparer also performs various 
other functions; primarily, the Explanation Comparer consults the Student 
Learning Model and determines if the student knows the rules in question, 
are they acceptable for teaching, etc.. This is a cursory overview of 
the process, but we have overlooked many of the problems of such 
comparisons. In the remainder of th i s Appendix we will look at these 
issues more closely. 

Uhen the Uumpus Advisor is comparing two different moves it must 
occasionally consider which of the probability rules is involved. This 
is true if the Uumpus Advisor is analyzing the student’s move to update 
his Student Model, or if the Uumpus Advisor is determining if it can 
explain a particular good move to the player. In any case, the Move 


( 1 ) In this light, the Result Explainer is structured almost exactly like 
the Expert. 
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Comparer must be able to decide which probability rules are applicable in 
different situations. 

The Uumpus Expert only considers the most accurate probability 
rules in its analysis, but the Uumpus Advisor must be able to determine 
when less advanced probability rules are involved in a particular 
comparison. For example, in Figure B.l there is a common situation that 
is presented to moderately advanced players. Cave 8 is very likely to 
contain a pit because of the double evidence of a pit (i.e. the breezes 
which were felt in caves 3 and 12). This makes it rather unlikely that 

either of caves 4 and 18 contain a pit since a pit in cave 8 would 

explain the breezes in caves 3 and 12. The rule which justifies the high 
probability of a pit in cave 8 is P13. The rule which justifies the low 
probability for caves 4 and 18 is rule P14. Now, if the Uumpus Advisor 
were comparing the two probabilities for caves 4 and 8, it might note 

that rules P13 and P14 were both involved in the comparison. However, a 

student could know only rule P13 and not rule P14, and he would still 
choose the better move. Correspondingly, the Uumpus Advisor could 
explain why cave 8 was a bad move by simply explaining how P13 applied. 

Figure B.l 


8 
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Such situations require that the Uutnpus Advisor (in particular, the 
Move Comparer) make a detailed analysis when two probabilities are being 
compared. For this purpose, the Uumpus Expert computes the probability 
for every cave with respect to each probability rule. It then chooses 
the prefered probability (the most reliable) and saves all of the 
different probabilities. The first step of the Wumpus Advisor’s analysis 
is to insures that one cave actually does have a lower risk than the 
other move. * 1 ' ^ It then determines which of the following cases applies 
by checking them in order. 

A) If the better cave (Setter) is safe, then the only applicable 
rules are the rules which make Better safe. This is because 
the presumption is that caves are moderately dangerous unless 
there is some evidence indicating that they are safe. 

8) Correspondingly, if Worse is certain to contain the danger, 
then the only applicable rules are the rules uhich show that 
Worse contains the danger (normally L0). This is because the 
presumption is that Better is moderately unsafe. 

C) If both of the above two rules are applicable, then choose the 
simpler of the two explanations (least advanced) as the 
necessary explanation, 

0) If the comparison of the two Pll probabilities will justify the 
quality of the Better move, then Pll is the only applicable 
ru I e. 

E) If P12 is applicable for the Better cave’s probability, then 
P12 is th^ only rule that is applicable. 

F) If the P13 probability for Worse is greater than the Pll 

probability for Better, then Pll and P13 are the appropriate 
probabiIity rules. 

G) If the P14 probability for Better is less than the Pll 

probability for Worse, then the probability rules which are 
applicable are rules Pll and P14. 

H) If none of the above apply, then use the simplest probability 


(ii) This comparison is based on the "preferred" probabilities of the 
Uumpus Expert. 
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rules which will justify the "preferred" probability.^' 1 '^ 

In this fashion, the Uumpus Advisor determines the simplest possible 
rules which can justify the difference between two caves. This 
information is used in analyzing the student’s move as well as in 
determining whether or not the better move is suitable for explaining for 
the student. Once the Uumpus Advisor has discerned that a move can be 
explained to the student, it then determines the best rationale to 
explain to the student. In the example above, the Uumpus Advisor might 
decide to explain to the student that cave 4 is a better move. Depending 
on Uusor’s estimation of the student’s knowledge, Uusor would choose to 
explain either rule P13 or rule P14. Uusor’s explanation would center on 
rule P14 if the student already knew rule P13, and the explanation would 
center on rule P14 if the player was not yet advanced enough to allow the 
explanation of rule P14. 

Once Uusor has decided to explain the comparison between two moves 
and has selected rationales to be explained, it must then create an 
explanation with the appropriate emphasis. The points of emphasis of 
such explanations are shown in Table B.2. 

Table B.2 


Rationale For 
Better Cave 

Rationale For 
Uorse Cave 

Emphasis of Explanation 

Rule Pll 

Rule Pll 

The Cave-Sets involved 

Rule P12 

Not Rule P12 

The explanation of P12 

Rule P14 

Not Rule P14 

The explanation of P14 

Not Rule P13 

Rule P13 

The explanation of P13 

Rule P15 

Any Other 

Not Rule P15 
Any Other 

The explanation of P15 
Both Probabi1i ties 


111 If a cave is only a member of one cave-set, then the 
probabilities for Pll, P13, and P14 will be equal in most cases. 
Therefore, Pll is applicable for that particular cave. 
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Uusor uill generate the explanation which first fits. Examples of some 
of these explanations are in Appendix D. 
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Analysis of Game ModiFicatlons 
Appendix C 

Before the start of a game, the Uumpus Advisor Mill attempt to 
modify the game to create a situation which is thought to be a 
challenging problem to the student. This is done by looking for a 
starting cave within the warren which meets all the requirements of the 
situation desired. Typically, the requirements will involve the warnings 
at the cave in question as well as the warnings in one of its neighboring 
caves. If a cave is found that meets all of the requirements, the Uumpus 
Advisor will move the starting cave to the desired cave. Then, the 
player will be informed about his starting location (the one selected by 
the Uumpus Advisor) and what the neighboring caves are that he can move 
to. Once the player has selected a cave (at random since he does not 
have enough information to choose wisely), Uusor will transpose the caves 
of the warren so that he moves to the neighboring cave selected by the 
Uumpus Advisor. The requirements for starting situations vary for 
students of different phases. These requirements are shown in Table C.l. 


Tab Ie C.1 


Phase 

Starting Cave 
Requirements 

Neighbor* s 
Uarning 

Requirement On 
Overa11 Situation 

Teaching 
Point 

0 

— 

No Uarnings 

Some Uarning 

None 

Backtracking 

i i 

Bat or Pit Uarning 

Other Uarning 

Neighbor in Common 

L3 and L5 

1 2 

I 

Bat or Pit Uarning 

Same Uarning 

Smaller Cave-Set 

Pll 

3 

Bat or Pit Uarning 

Same Uarning 

Neighbor in Common 

P13 

1 A 

1 ^ 

No Uumpus Uarning 

Any Uarning 

None 

P13 (Uumpus) 
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Examples of each of these situations are shown in Appendix D. There are 
not acceptable starting caves in many warrens, so that the Uumpus Advisor 
only creates these situations roughly half of the time. This, when added 
to the fact there are various possible starting configurations for each 
phase, should be sufficient to prevent the student from ever recognizing 
the similarities of the starting situations before he is in a new phase. 
(iv) However, even when the Uumpus Advisor does not modify the game to 
create an ideal situation, it will transpose the player’s first move if 
it is to a cave containing a danger, thereby preventing the player from 
ever losing on the first move (which can have very bad effect on morale). 
This ability to transpose moves can easily be extended to give positive 
or negative reinforcement as necessary. 


Thi8 is a hypothesis which has not yet been verified through 
experimentation. 
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Examples of Ideal Situations and Probability Compa 

Appendix D 

This appendix contains examples of the "ideal situations" that were 
created for students in different phases along with the corresponding 
explanations. The requirements for the situations are given in Appendix 
C and the structure of the explanations i9 given in Appendix B. These 
are actual situations created by the Uumpus Advisor and the actual 
explanations given. 

A beginning player (Phase 0) who is being taught the rules would be 
faced with a situation very similar to this. 

Figure 0.1 



This situation would give rise to the following explanation if the player 
asked to move to cave 18. 

Brian, did you know that we can backtrack to caves that we 
have already visited? Yes 

Oh,... UeI I then why not go back to cave 3 where we 
didn’t get any warnings at all. From there we can safely go 
to cave 13 without risking the bats. It is not wise to visit 
caves with bats because while THEY will not harm us they will 
carry us to another cave which could contain a pit or the 


Page 95 












The Uumpus Advisor 


Appendices 


Wumpus. 

Would you like to go to cave 18 anyway? 

A player who was slightly wore advanced (Phase 1) and presumed to 
have gotten a basic understanding of the ruleB, would be faced with the 
following situation. 

Figure D.2 



Thie situation would give rise to the following explanation if the player 
asked to wove to cave 0. 

Brian, we don’t need to risk bats in cave 0 as cave 
16 is safe. Cave 16 can not contain bats because, if bats 
were there, we would have heard squeaking in cave 11. It 
is best to avoid bats as they could carry us to a fatal 
cave. 

Would you like to go cave 0 anyway? 

A player who is presumed to have mastered the rules necessary to 
discern which caves are safe (Phase 2), would be faced with a situation 
similar to this. 
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Figure D.3 



This situation would give rise to the following explanation if the player 

asked to move to cave 18. 

Brian, it isn’t necessary to take such large risks 
with bats. It is true that one of caves 0, 12, and IS 
contains bats because we heard squeaking in cave 1, but one 
of caves 18 and 11 contains bats because ue heard squeaking 
in cave 10. This makes it less likely that cave 0 contains 
bats. Therefore, Brian, we might want to explore cave 0 
i nstead. 

Would you like to go cave 18 anyway? 

A player who has mastered the simple probability rules would be 
given a situation like this. 
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This situation would give rise to the following explanation if the player 
asked to move to cave 3. 

Brian, it isn’t necessary to take such large risks 
with pits. One of caves 3, 7, and 8 contains a pit. 

Likewise, one of caves 3, 2, and G contains a pit. This 
is multiple evidence of a pit in cave 3 which makes it 
probable that cave 3 contains a pit. It is less likely 
that cave 8 contains a pit. Hence, Brian, we might want 
to explore cave 8 instead. 

Uould you like to go to cave 3 anyway? 

In the course of playing a game, a player of this level could be 
faced with a situation like this. 


Figure D.5 



This situation would give rise to the following explanation if the player 

asked to move to cave 16. 

Brian, it isn’t necessary to take such large risks 
with bats* It is true that one of caves 18, 0, and 9 
contains bats, but cave 9 contains bats because we were 
picked up by bats when we visited it before. This explains 
a I I the evidence for bats in caves 18, 0, and 9. As we do 
not have any evidence of cave 0 containing bats, we can 
presume that it is unlikely that cave 0 contains bats. 

There is evidence of bats in cave 16 which makes it a more 
dangerous cave. Therefore, Brian, we might want to explore 
cave 0 instead. 

This is an explanation of rule P12 for bats as described in 
Appendix B. 
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Sample W urn pus Advisor Scenario 
Appendix E 

In the following scenarios, both Uusor’s statements and the player’s 
responses will be indented to the actual fields used; the player’s 
response will be underlined. These scenarios have been constructed by 
combining advice actually given by Uusor, but they are not realistic 
because Uusor would never give advice so often and because there is not a 
consistent Student Model throughout. This Appendix is intended to 
give the reader some idea of Uusor’s ability to give advice in a wide 
range of situations. In this regard, it is not all inclusive since Uusor 
has given advice in many situations far removed from the ones shown. 

Hello, my name is Uusor VII. Uhat is your name? JIM STANSFIELD 

Do I have your name right, Jim Stansfield? Yes 

Uould you like instructions? No 

Just a second while I erase my blackboard. 

In this game, Jim, there will be 23 caves all total, three 
pits, and three caves with bats. 


In fact a new player would not be put in a warren with so many bats and 
pi ts. 

O.K., now I will draw up a new warren. 

Ue are now at cave S. 

The neighboring caves are caves 3, 12, and 5. 

Uhat now? 12 


v Uusor’s behavior is highly dependent on the current Student Model, 
and the following scenarios have been designed with a successively more 
advanced Student Model. In fact, the Student Model would not be 
advancing in such a manner because of the consistent mistakes by the 
player. 
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Ue are now at cave 12. 

The neighboring caves are cave9 1, 8, 18, and 6. 

Squeak. I hear bats they must be in one of the neighboring 
caves. 

Uhat now? 1_ 

Figure E.1 



Jim, did you know that we can backtrack to caves that we 
have already visited? No 

Ue1 I we can, so why not go back to cave 6 where we didn’t 
get any warnings at all. From there we can safely go to cave 3 
without risking bats. Ue should avoid bats because they could 
drop us in a cave with a pit or the Uumpus. 

Would you like to go cave 1 anyway? No 

We are now at cave 12. 

The neighboring caves are caves 1, 8, 18, and S. 

Squeak. 1 hear bats. 

What now? 6 

We are now at cave S. 

The neighboring caves are caves 3, 12, and 5. 

What now? 3 

We are now at cave 3. 

The neighboring caves are caves 2, 6, 5, and 17. 

Brrrr. I feel a breeze, we must be next to a pit. 

Whew, what a stench! That is the smell of the Uumpus. It 
means that we are within two caves of the Uumpus. 

Uhat now? Shoot 
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Figure E.2 



Into which cave would you like to shoot? 17 

Cave 17 can not contain the Wumpus because, if the Uumpus 
were there we would have smelled him in cave G as cave G is 
within two caves of cave 17, If we shoot an arrow into cave 17, 
it could ricochet back and kill us. 

Uould you like to shoot into cave 17 anyway? Yes 

/TV Poing 

Poing 
Po i ng 

Ue are now at cave 3. 

The neighboring caves are caves 2, G, 5, and 17. 

Brrrr. I feel a breeze. 

What a stench! The Uumpus is near. 

What now? 17 

Hummm. 

Jim, we don’t need to risk a pit in cave 17 as cave 5 is 
safe. Cave 5 can not contain a pit because, if a pit were there, 
we would have felt a breeze in cave G. Ue should try not to 
stumble into a pit as it would be fatal. 

Uould you like to go to cave 17 anyway? No 

Ue are now at cave 3. 

The neighboring caves are caves 2, 6, 5, and 17. 

Brrrr. 

Uhat a stench! 

Uhat now? 5 

Ue are now at cave 5. 

The neighboring caves are caves 15, G, 9, and 3. 
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Brrrr. 

Uhat now? 9 



Uould you like to go cave 9 anyway? No 


Ue are now at cave 5. 

The neighboring caves are caves 15, G, 9, and 3, 

Brrrr. 

Uhat now? Route 

Uhat cave would you I ike to go to? 18 

To get to cave 18, Jim, we could move to caves S, 12, and 18. 
Uould you like a route to another cave? No 
Ue are now at cave 5. 

The neighboring caves are caves 15, G, 9, and 3. 

Brrrr. 

Uhat now? 6 

Ue are now at cave G. 

The neighboring caves are caves 3, 12, and 5. 

Uhat now? 12 

Ue are now at cave 12. 

The neighboring caves are caves 1, 8, 18, and G. 

Squeak. 
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Uhat now? IS 


This game would continue along the same lines. We will shift to a more 
advanced player. 

He Ilo, my name is Uusor VII. What is your name? Brian CArr 
Do I have your name right, Brian Carr? Yes 

I believe that we have hunted Wumpii together before. Is that 
right? Y 

Just a second while I erase my blackboard. 

In this game, Brian, there will be 20 caves all total, three 
pits, and three caves with bats. 

O.K., now I will draw up a new warren. 

We are now at cave 8. 

The neighboring caves are caves 6, 9, 13, and 10. 

Brrrr. 

Uhat now? G 

We are now at cave 6. 

The neighboring caves are caves 8, 5, and 0. 

Brrrr. 

Uhat now? 0 

Figure E.4 


0 10 



Brian, It ien’t necessary to take euch large risks with 
pits. It is true that one of caves 13, 9, and 10 contains a 
pit because we felt a breeze in cave 8. However, one of caves 


Page 103 










The Wumpus Advisor 


Appendices 


0 and 5 contains a pit because we felt a breeze in cave G. 
This makes it less likely that cave 9 contains a pit, and so, 
Brian, we might want to explore cave 9 Instead. 

Would you like to go cave 0 anyway? No 

He are now at cave G. 

The neighboring caves are caves 8, 5, and 0. 

Brrrr. 

What now? 8 

We are now at cave 8. 

The neighboring caves are caves G, 9, 13, and 10. 

Brrrr. 

I 

What now? 9 

We are now at cave 9. 

The neighboring caves are caves 4, 13, 8, and 7. 

Brrrr. 

What now? 13 

Figure E.5 



Hummm. 

Brian, it isn’t necessary to take such large risks with 
pits. One of caves 10 and 13 contain a pit. Also, one of 
caves 7, 4, and 13 contains a pit. This is multiple evidence 
of a pit in cave 13 which makes it quite likely that cave 13 
contains a pit. It is less likely that cave 4 contains a pit. 
Hence, Brian, we might want to explore cave 4 instead. 

Would you like to go cave 13 anyway? No 

We are now at cave 9. 

The neighboring caves are caves 4, 13, 8, and 7. 

Brrrr. 
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What nou? Info 

What cave would you like to know about? 8 

The neighbors of cave 8 are caves 6, 9, 13, and 10. 
It has warning "Brrrr.". 

Would you like to know about another cave? No 
We are now at cave 9. 

The neighboring caves are caves 4, 13, 8, and 7. 
Brrrr. 

What now? 4 

We are now at cave 4. 

The neighboring caves are caves 7, 3, and 9. 

What a stench! 

What now? 7 


Figure E.6 



Hummm. 

Cave 3 is a very good cave to explore, Brian. Cave 3 must 
be next to the Wumpus because we smelled the Wumpus in cave 4. 
We know that cave 4 is not next to the Wumpus, and so cave 4 
is two caves away from the Wumpus. One of caves 2, 16, and 0 
must be next to the Wumpus, but we also know that caves 7 and 
9 are not next to the Wumpus. Therefore, cave 3 must be next 
to the Wumpus, and, if we visit cave 8, we will gain 
information about the location of the Wumpus. 

Would you like to go cave 7 anyway? Yes 

We are now at cave 7. 

The neighboring caves are caves 17, 4, and 9. 

Squeak. 
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What now? 4 

We are now at cave 4. 

The neighboring caves are caves 7, 3, and 3. 
What a stench! 

What now? 3 


We are now at cave 3. 

The neighboring caves are caves 4, 12, 10, and 18. 
What a stench! 

What now? 18 



As it is quite possible that cave 18 contains the Wumpus, 
Brian, you might want to shoot an arrow into cave 18 before we 
visit it. 


Would you like to shoot an arrow into cave 18 instead? Yes 

Congratulations, Brian, you have shot the Wumpus. 

Your record is eight wins and four losses. 

Would you like to play again? No 


Weil, Brian, have a nice day. Please look me up next time 
you want to go on a Wumpii hunt. 


I am returning you to DDT.... 
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Results of Expen i merits With the Synthetic Student 

Appendix F 

The following are the "results" of the Synthetic Student 
experiments. The complete protocols are, in fact, much more revealing, 
but altogether amount to well over a thousand pages of computer output. 
An editted portion of the last experiment is shown in the following 
Appendix. This protocol is repre9entat i ve of ali of the protocols 
(except that Syndi normally had a fixed state and did not "learn"). In 
analyzing the tables, the actual numbers shown are of little importance. 
The key variable is whether or not the given rule is known (as indicated 
by an asterisk following the Familiarity Values). The first group of 
experiments were of the Type 1 described earlier. 


ule 

Experiment with Syndi Possessing 1 

Uusor’s Student Knowledge Model 

Bats Pi ts Wumpus 

: ull Knowledge 

Syndi’e Student 
Bats Pits 

Mode 1 

Wumpue 

0 

6.7830712* 

2.78307113* 

0.0- 

0.0* 

0.0* 

0.0* 

1 

386.78307* 

278.78307* 

30S.78307* 

0. 0* 

0. 0* 

0. 0* 

2 

0.0 

0.0 

4.7830712* 

0. 0* 

0.0* 

0. 0* 

3 

79.78307* 

80.78307* 

100.78307* 

0.0* 

0.0* 

0. 0* 

4 

27.7830712* 

27.7830712* 

62.7830715* 

0.0* 

0.0* 

0. 0* 

5 

79.78307* 

80.78307* 

100.78307* 

0.0* 

0. 0* 

0. 0* 

G 

0.0- 

0.0- 

9.78307116* 

0.0* 

0.0* 

0. 0* 

7 

22.7830712* 

27.7830712* 

62.7830715* 

0.0* 

0. 0* 

0. 0* 

8 

0.0 

0.0 

31.7830712* 

0.0* 

0.0* 

0.0* 

9 

0.0 

0.0- 

0.0- 

0.0* 

0. 0* 

0. 0* 

10 

8.78307127* 

8.78307127* 

4.7830712* 

0.0* 

0. 0* 

0. 0* 

11 

18.7830712* 

18.7830712* 

24.7830712* 

0.0* 

0.0* 

0.0* 

12 

3.78307113* 

2.78307113* 

0.0- 

0.0* 

0.0* 

0. 0* 

13 

5.7830712* 

3.78307113* 

7.7830712* 

0.0* 

0. 0* 

0.0* 

14 

2.78307113* 

2.78307113* 

0.0 

0.0* 

0.0* 

0. 0* 

IS 

0.0 

0.0 

25.7830712* 

0.0* 

0.0* 

0.0* 

19 

0.0 

0.0 

29.7830712* 

0.0* 

0.0* 

0.0* 
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Experiment with Syndi Possessing Knowledge up to Phase 2 


Rule 

Wusor’s 

Bats 

Student Knowledge Model 

Pits Wumpus 

Syndi*s 
Bats 

Student 
Pi ts 

Model 

Wumpus 

0 

0.0- 

0.0- 

0.0- 

1.0* 

1.0* 

1.0* 

1 

437.78307* 

23S.78307* 

235.78307* 

11.0* 

11.0* 

11.0* 

2 

0.0 

0.0 

0.0- 

0.0 

0.0 

1.0* 

3 

27.7830712* 

33.7830715* 

8.78307127* 

1.0* 

1.0* 

1.0* 

4 

1.0- 

1.0- 

3.78307113* 

0.0 

0.0 

0.0 

5 

27.7830712* 

33.7830715* 

8.78307127* 

1.0* 

1.0* 

1.0* 

G 

0.0 

0.0 

0.0- 

0.0 

0.0 

0.0 

7 

1.0- 

1.0- 

3.78307113* 

0.0 

0.0 

0.0 

8 

0.0 

0.0 

0.0- 

0.0 

0.0 

0.0 

9 

0.0 

0.0 

0.0- 

0.0 

0.0 

0.0 

10 

0.0 

0.0 

0.0- 

0.0 

0.0 

0.0 

11 

1.0- 

1.0- 

0.0- 

0.0 

0.0 

0.0 

12 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

13 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

14 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

15 

0.0 

0.0 

0.0- 

0.0 

0.0 

0.0 

13 

0.0 

0.0 

2.78307113* 

0.0 

0.0 

1.0* 


Experiment uith Syndi Possessing Knowledge up to Phase 3 


Rule 

Wusor’s Student Knowledge Model 

Bats Pits Wumpus 

Syndi 

Bat9 

*8 Student 
Pits 

Mode 1 
Wumpus 

0 

2.78307113* 

2.78307113* 

0.0- 

1.0* 

1.0* 

0.0 

1 

130.78307* 

32.78307* 

95.78307* 

34.0* 

34.8* 

34.0* 

2 

0.0 

0.0 

4.7830712* 

0.0 

0.0 

2.0* 

3 

21.7830712* 

24.7830712* 

21.7830712* 

1.0* 

1.0* 

5.0* 

4 

8.78307127* 

8.78307127* 

15.7830713* 

2.0* 

2.0* 

4.0* 

5 

21.7830712* 

24.7830712* 

21.7830712* 

1.0* 

1.0* 

5.0* 

S 

0.0- 

0.0- 

4.7830712* 

0.0 

0.0 

0.0 

7 

3,78307113* 

8.78307127* 

15.7830713* 

2.0* 

2.0* 

4.0* 

8 

0.0 

0.0 

8.78307127* 

0.0 

0.0 

0.0 

9 

0.0 

0.0- 

0.0- 

0.0 

0.0 

0.0 

10 

3.78307113* 

3.78307113* 

1.0- 

2.0* 

2.0* 

0.0 

11 

6.7830712* 

6.7830712* 

8.78307127* 

1.0* 

1.0* 

0.0 

12 

1.0- 

1.0- 

0.0- 

0.0 

0.0 

0.0 

13 

1.0- 

1.0- 

0.0- 

0.0 

0.0 

0.0 

14 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

15 

0.0 

0.0 

8.78307127* 

0.0 

0.0 

0.0 

19 

0.0 

0.0 

8.78307127* 

0.0 

0.0 

3.0* 
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The next set of experiments were of the Type 2 with Wusor initially 
assuming that Syndi had a complete knowledge of the game. Because of the 
nature of the Phases, once the player is no longer presumed to know a 
simple rule (like rules 3 and 5), the higher rules can no longer be 
degraded. In any case, the player wa9 reduced to the appropriate phase, 
and advice would only be given at the appropriate level. 


Experiment With Wusor Degrading his Student Model 


Ru 1 e 

Uusor’s 

Bats 

Student Knowledge Model 

Pits Wumpus 

Syndi’e 
Bat9 

Student Model 

Pits Wumpus 

0 

0.84- 

1.0*+ 

0.0 

1.0* 

1.0* 


1 

120.0*+ 

112.0*+ 

113.0*+ 

1.0* 

1.0* 


2 

0.0 

0.0 

0.0- 

0.0 

0.0 

1.0* 

3 

0.34-+ 

0.34-+ 

0.34-+ 

0.0 

0.0 

0.0 

4 

3.0*+ 

2.0*+ 

3.0*+ 

0.0 

0.0 

0.0 

5 

0.34-+ 

0.0-+ 

0.34-+ 

0.0 

0.0 

0.0 

G 

0.0- 

0.0 

0.0 

0.0 

0.0 

0.0 

7 

3.0*+ 

2.0*+ 

3.0*+ 

0.0 

0.0 

0.0 

8 

0.0 

0.0 

2.0*+ 

0.0 

0.0 

0.0 

9 

0.0 

0.0 

1.0* 

0.0 

0.0 

0.0 

10 

1.0*+ 

1.0*+ 

0.0 

0.0 

0.0 

0.0 

11 

3.0*+ 

2.0*+ 

0.0 + 

0.0 

0.0 

0.0 

12 

1.0*+ 

1.0*+ 

0.0 + 

0.0 

0.0 

0.0 

13 

1.0*+ 

0.0 + 

1.0*+ 

0.0 

0.0 

0.0 

14 

15 
19 

1.0*+ 

0.0 

0.0 

0.34 + 

0.0 

0.0 

0.0 

0.0 + 

0.0-+ 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 


r\ 
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The last set of experiments uere of the Type 3, where Syndi started 
with a minimal knowledge of the game (rules 0, 1. and 2) and learned 
rules as they were explained to her# 


Experiment with Syndi Learning 


Rule 

Uusor’s 

Bats 

Student Knowledge Model 

Pits Uumpus 

Syndi's 
Bats 

Student 

Pits 

Model 

Uumpus 

0 

10.0*+ 

4.0* 

0.0- 

3.0* 

1.0* 

1.0* 

1 

517.0*+ 

406.0*+ 

423.0#+ 

6.0# 

4.0* 

29.0* 

2 

0.0 

0.0 

7.0*+ 

0.0 

0.0 

9.0* 

3 

118.0* 

118.0*+ 

68.0*+ 

1.0* 

1.0* 

29.0* 

4 

50.0*+ 

51.0* 

52.0#+ 

3.0* 

3.0* 

27.0* 

5 

118.0* 

118.0*+ 

68* 0#+ 

1.0* 

1.0# 

29.0* 

6 

0.0- 

0.0- 

2.0* 

0.0 

0.0 

0.0 

7 

33.0*+ 

51.0* 

52.0*+ 

4.0* 

3.0* 

32.0* 

8 

0.0 

0.0 

35.0*+ 

0.0 

0.0 

32.0* 

9 

0.0 

0.0- 

0.0- 

0.0 

0.0 

0.0 

10 

13.0*+ 

13.0* 

6.0*+ 

1.0* 

1.0* 

11.0* 

11 

38.0#+ 

39.0* 

28.0*+ 

1.0* 

1.0* 

4.0* 

12 

0.0- 

0.0- 

0.0- 

0.0 

0.0 

0.0 

13 

10.0* 

3.0*+ 

4.0* 

1.0* 

1.0* 

0.0 

14 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

15 

0.0 

0.0 

31.0* 

0.0 

0.0 

0.0 

19 

0.0 

0.0 

22.0*+ 

0.0 

0.0 

0.0 
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Scenario Between Syndi Learn and Wusor 

Appendix G 

What follows is a typical protocol with the Synthetic Student and 

Wusor. At various points each had inserted their current student model 

values into the file, but these were deleted to save space. Also, 

between every move both Wusor and Syndi would output their analyses of 

each move, but most of these have also been deleted to conserve space 

(the remaining are shown in italics). The complete protocols from all the 

experiments are available through the author. 

Comments made by Wusor which would not normally be displayed for 

the student are enclosed in "#*#". They constitute a trace of Wusor's 

reasoning and make Wusor more believable to a human teacher as well as 

serving as an invaluable debugging aid. Syndi’s comments are much less 

frequent and are enclosed in "ttHtt". Syndi’s replies are all listed in 

upper case characters and underlined. 

Hello, my name is Wusor VIII. What is your name? (Please 
type in your first and last name. Follow all responses with 
a space.) SYNDI LEARN 

Do I have your name right, Syndi Learn? YES 

I believe that we have hunted Wumpii together before. Is 
that right? YES 

In this game, Syndi, there will be 20 caves all 
total, one pit, and one cave with bats. 

Just a second while I erase my blackboard. 

*** Time is 1S0. *** 

O.K., now I wiI I draw up a new warren. 

*** I am altering the game to create a situation appropriate 
for a player of phase 0. *** 

We are now at cave 11. 

The neighboring caves are caves 14, 1, 9, and 19. 

What now? 
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The following is my estimation of the moves. 


Cave 

Cost 

Gain 

Index 

19 

0.198270723 

0.80131574 

4.0415233 

9 

0.198270723 

0.80131574 

4.0415233 

1 

0.198270723 

0.80131574 

4.0415233 

14 

0.198270723 

0.80131574 

4.0415233 

,4 

after the cave number 

indicates that these 

figures 

reflect shooting into the 

cave before visiting 

Cave 

Cost 

Gain 

Index 

19 

0.0 

1.0 

10000.0 

9 

0.0 

1.0 

10000.0 

1 

0.0 

1.0 

10000.0 

14 

14 

0.0 

1.0 

10000.0 

We are 

now at cave : 

14. 



The neighboring caves are caves 16, 5, 19, and 11. 
Brrrr. I feel a breeze! We must be next to a pit. 


What nou? 


Mi? The 

following is 

my estimation of the moves. 


Cave 


Cost 

Gain 

Index 



9 


0.198270723 

0.79707435 

4.0201313 



1 


0.198270723 

0.79707435 

4.0201313 



19 


0.198270723 

0.79707435 

4.0201313 



5 


0.198270723 

0.79707435 

4.0201313 



16 


0.198270723 

0.79707435 

4.0201313 




A 

after the cave number 

indicates that 

these 

figures 

reflect shooting into the 

cave before 

visiting 

Cave 


Cost 

Gain 

Index 



5 


0.5 

0.5 

1.0 



16 


0.5 

0.5 

1.0 



9 


0.0 

1.0 

10000.0 



1 


0.0 

1.0 

10000.0 



19 


0.0 

1.0 

10000.0 



16 








Syndi, why not 

go back to cave 11 where 

we 

d i dn ’ 


get any warnings at all. From there we can safely go 
cave 1 without risking the pit. We should try not to 
stumble into the pit as it would be fatal. 

*#* I have used: (rule 1 for the Wumpus), (rule 1 for 
pit), and (rule 1 for the bats) in explaining combinat 
rule 8. **# 

Would you like to go to cave 16 anyway? NO 
We are now at cave 14. 

The neighboring caves are caves 16, 5, 19, and 11. 
Brrrr. I feel a breeze. 

What now? 11 


Appendices 


it. 


it. 


t 

to 

the 
i on 
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*** I have stopped initializing the player’s knowledge model 
for the bats because he seems to be learning rule 1. *** 

*** I am advancing the student to phase 1 for the bats as he 
has mastered rule 1. *** 

*** I have stopped initializing the player’s knowledge model 
for the pit because he seems to be learning rule 1. *** 

*** I am advancing the student to phase 1 for the pit as he 
has mastered rule 1. *** 

*** I have stopped initializing the player’s knowledge model 
for the Uumpus because he seems to be learning rule 1. 

*** I am advancing the student to phase 1 for the Uumpus as 
he has mastered rule 1. *#* 

Ue are now at cave 11. 

The neighboring caves are caves 14, 1, 9, and 19. 

Uhat now? 1_ 

*** I am upgrading the player’s receptivity because his move 
to cave lisa move that I told him was good very recently. 
*** “ 

Ue are now at cave 1. 

The neighboring caves are caves 0, 15, and 11. 

Uhew, what a stench! That is the smell of the Uumpus. It 
means that we are within two caves of the Uumpus. 

Uhat now? 


### The following is my estimation of the moves. 


Cave 

Cost 

Gain 

Index 

5 

0.198270723 

0.79283295 

3.9987394 

16 

0.198270723 

0.79283295 

3.9987394 

19 

0.198270723 

0.79283295 

3.9987394 

9 

0.198270723 

0.79283296 

3.9987394 

16 * 

0.1 

2.792833 

27.9283297 

0‘ 

0.1 

2.792833 

27.9283297 

A 

""" after the cave number 

indicates that these 

figures 

reflect shooting into the 

cave before visiting it 

Cave 

Cost 

Gain 

Index 

6 

0.5 

0.5 

1.0 

16 

0.5 

0.5 

1.0 

19 

0.0 

1.0 

10000.0 

9 

0.0 

1.0 

10000.0 

16 

0.0 

1.1 

11000.0 

0 

SHOOT 

0.0 

1.1 

11000.0 

Into which cave woul 

d you 1 ike to 

shoot? 0 


Cave 0 can not contain the Uumpus because, if the 
Uumpus were there, we would have smelled the Uumpus in cave 
11 as cave 0 is within two caves of cave 11, and, if we 
O' shoot an arrow into cave 0, it could ricochet back and kill 

us. 
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#** I have used: (rules 3 and 5 for the Wumpus) in 
explaining combination rule 9. *** 
tltttt I have just "learned" rule 3 for the Wumpus. tttttt 
UUU I have just "learned" rule 5 for the Wumpus. Uttti 
Would you like to shoot into cave 0 anyway? NO 

We are now at cave 1. 

The neighboring caves are caves 0, IB, and 11. 

What a stench! The Wumpus is near. 

What now? SHOOT 

Into which cave would you like to shoot? 15 
Po i ng 

You have just shot yourself. 

Would you like to take back your last move? YES 


We are now at cave 1. 

The neighboring caves are caves 0, 15, and 11. 

What a stench! 

What now? 15^ 

We are now at cave 15. 

The neighboring caves are caves 1, G, 3, and 10. 

Squeak. I hear bats, they must be in one of the neighboring 
caves. 

What a stench! 


What now? 


The 

Cave 

5 

16 

19 

9 

0 

10 * 

3 m 

6 " 

A 

figures 


following is my estimation of the moves. 
Cost Gain Index 


0.198270723 

0.198270723 

0.198270723 

0.198270723 

0.198270723 

0.1 

0.1 


0.78859156 

0.78859156 

0.78859156 

0.78859156 

0.78859156 

2.78859156 

2.78859156 

2.78859156 


0.1 

" x " after the cave number 
reflect shooting into the 


3.9773475 

3.9773475 

3.9773475 

3.9773475 

3.9773475 

27.8859157 

27.8859157 

27.8859157 

indicates that these 

cave before visiting 


Cave 

Cost 

Gain 

Index 

5 

0.6 

0.5 

1.0 

16 

0.5 

0.5 

1.0 

10 

0.227228693 

0.7828529 

3.4452203 

3 

0.227228693 

0.7828529 

3.4452203 

6 

0.227228693 

0.7828529 

3.4452203 

19 

0.0 

1.0 

10000.0 

9 

0.0 

1.0 

10000.0 

0 

0.0 

1.1 

11000.0 


it. 
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SHOOT 

Into which cave would you like to shoot? 8 

Congratulations, Syndi, you have shot the Uumpus. 

Your record is one wins and one losses. 

Would you like to play again? YES 

In this game, Syndi, there will be 20 caves all 
total, two pits, and two caves with bats. 


Just a second while I 
*** Time is 149. *** 
O.K., now I will draw 
*** I am altering the 
for a player of phase 


erase my blackboard, 
up a new warren. 

game to create a situation appropriate 

0 . *** 


We are now at cave 3. 

The neighboring caves are caves 7, 0, and 15. 


What now? 7 


We are now at cave 7. 

The neighboring caves are caves S, 3, 13, and 18. 
Squeak. I hear bats. 

Brrrr. 


What now? 


The following is my estimation of the moves. 


Cave 

Cost 

Gain 

Index 

0 

0.114270098 

0.88430289 

7.7387077 

15 

0.114270098 

0.88430289 

7.7387077 

18 

0.114270098 

0.88430289 

7.7387077 

13 

0.114270098 

0.88430289 

7.7387077 

6 

0.114270098 

0.88430289 

7.7387077 

A 

after the cave number 

indicates that these 

figures 

reflect shooting into the 

cave before visiting it. 

Cave 

Cost 

Gain 

Index 

18 

0.36856813 

0.6279084 

1.70364266 

13 

0.36856813 

0.6279084 

1.70364266 

6 

0.36856813 

0.6279084 

1.70364266 

0 

0.0 

1.0 

10000.0 

15 

6 

0.0 

1.0 

10000.0 

Humrnmm. 





*** Better moves which I have not mentioned to the student 
recently are caves 15 and 0. *** 

Syndi, we don't need to risk bats and pits in cave 6 
as cave 15 is safe. Cave 15 can not contain a pit because, 
if a pit were there, we would have felt a breeze in cave 3. 
We should avoid pits as they are fatal. Likewise, cave 15 
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is safe from the other dangers. 

##* I have used: (rules 3 and 5 for pits) in explaining 
combination rule 0. *## 

UUU I have just "learned" rule 3 for pits. UUU 

UUU I am transferring my knowledge of rule 3 for pits to 

bats. UUU 

UUU I have just "learned" rule 5 for pits. UUU 

UUU I am transferring my knowledge of rule 5 for pits to 

bats. UUU 

Would you like to go to cave S anyway? NO 
We are now at cave 7. 

The neighboring caves are caves G, 3, 13, and 18. 

Squeak. 

Brrrr. 


What now? 3 


We are now at cave 3. 

The neighboring caves are caves 7, 15, and 0. 

What now? 15 

*** I am upgrading the player’s receptivity because his move 
to cave 15 is a move that I told him was good very recently. 
*** 

We are now at cave 15. 

The neighboring caves are caves 1, 4, 3, and 8. 


What now? 


#/# The following is my estimation 
Cave 
18 
18 
6 


0 
8 
4 
1 

A 

figures 

Cave 

18 

13 

6 

0 

8 

4 

1 

1 


Gain 

0.87938217 
0.87938217 
0.87938217 
1.0 
1.0 
1.0 
1.0 

after the cave number 
reflect shooting into the 
Cost Gain 

0.36856813 0.61575847 


Cost 

0.114270098 

0.114270098 

0.114270098 

0.0 

0.0 

0.0 

0.0 


0.36856813 

0.35856813 

0.0 

0.0 

0.0 

0.0 


0.61575847 

0.61575847 

1.0 

1.0 

1.0 

1.0 


of the moves. 

Index 

7.6956455 

7.6956455 

7.6956455 

10000.0 

10000.0 

10000.0 

10000.0 

indicates that these 
cave before visiting it. 
Index 

1.67067747 
1.67067747 
1.67067747 
10000.0 
10000.0 
10000.0 
10000.0 


We are now at cave 1. 
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The neighboring caves are caves 15, 2, 14, and 9. 

Squeak. 

What a stench! 

Uhat now? 15 

We are now at cave 15. 

The neighboring caves are caves 1, 4, 3, and 8. 

What now? 4 

*** rioves which the student seems to have correctly 
identified as worse are caves 2, 14, and 9. *** 

*** According to combination rule 0, I am marking rules 3 
and 5 for bats because his move is better than cave 2. #** 
Ue are now at cave 4. 

The neighboring caves are caves 15, 11, 18, and 2. 

Squeak. 

Brrrr. 

Uhat now? 1E> 

Ue are now at cave 15. 

The neighboring caves are caves 1, 4, 3, and 8. 

Uhat now? 8 

*** Moves which the student seems to have correctly 
identified as uorse are caves 2, 11, and 18. *** 

*** According to combination rule 0, I am marking rules 3 
and 5 for bats because his move is better than cave 2. *** 
*** I am advancing the student to phase 2 for bats as he has 
mastered rules 3 and 5. *** 

*** According to combination rule 0, I am marking rules 3 
and 5 for pits because his move is batter than cave 11. 

*** 

*** I am advancing the student to phase 2 for pits as he has 
mastered rules 3 and 5. *** 

Ue are now at cave 8. 

The neighboring caves are caves 15, 19, 13, and 2. 

Brrrr. 

Uhat now? 2 

Ue are now at cave 2. 

The neighboring caves are caves 9, 1, 8, and 4. 

Uhat a stench! 

Uhat Now? 9 

Ue are now at cave 9. 

The neighboring caves are caves 10, 2, 1, and 5. 

Uhat a stench! 
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What nou? 1_ 

We are now at cave 1. 

The neighboring caves are cave9 15, 2, 14, and 9. 

Squeak. 

What a stench! 

What now? 15 

We are now at cave 15. 

The neighboring cave9 are caves 1, 4, 3, and 8. 

Uhat now? 3 

We are now at cave 3. 

The neighboring caves are caves 7, 15, and 0. 

What now? 0 

*** Moves which the student seems to have correctly 
identified as worse are caves 10, 5, 14, 19, 11, 13, 
and 18. *** 

*** According to combination rule 0, I am marking rules 3 
and 5 for the Wumpus because his move is better than cave 10. 
*** 

**# According to combination rule 0, I am marking rules 3 
and 5 for bats because his move is better than cave 14. 

*** 

*** According to combination rule 0, I am marking rules 3 
and 5 for pits because his move is better than cave 19. 

*** 

We are now at cave 0. 

The neighboring caves are caves 11, IB, and 3. 

What a stench! 

What now? 


### The following is mg estimation of the moves. 


Cave 

Cost 

Gain 

Index 

18 

0.114270098 

0.84986783 

7.4372723 

6 

0.114270098 

0.84986783 

7.4372723 

19 

0.100000001 

0.9 

8.99999988 

18 

0.100000001 

0.9 

8.99999988 

6' 

0.1 

2.9 

29.0000002 

10' 

0.1 

2.9 

29.0000002 

14 

0.0168666626 

0.9442866 

69.666169 

16 

0.0 

1.0 

10000.0 

11 

0.0 

1.0 

10000.0 


A after the cave number indicates that these 
figures reflect shooting into the cave before visiting it. 
Cave Cost Cain Index 

18 1.0 0.0 0.0 

19 0.6 0.6 1.0 
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13 

0.5 

0.5 

14 

0.158656688 

0.442864954 

6 X 

0 .16665 

3.416685 

10 m 

0.16665 

3.416685 

6 

0.0 

1.0 

11 

0.0 

1.0 

16 

11 

0.0 

1.1 


Appendices 


1.0 

2.79310346 

20.6021603 

20.6021603 

10000.0 

10000.0 

11000.0 


Hummmm, 

*** Better moves which I have not mentioned to the etudent 
recently are cave 16. *** 

We are now at cave 11. 

The neighboring caves are caves 0, 4, 17, and 18. 

Squeak. 

Brrrr. 


What now? 0 

We are now at cave 0. 

The neighboring caves are caves 11, 16, and 3. 
What a stench! 

What now? 16 


t *** Moves which the student seems to have correctly 
identified as worse are caves 17 and 18. *** 

We are now at cave 16. 

The neighboring caves are caves 13, 5, 10, and 0. 
Brrrr. 

What a stench! 

What now? 


*** The 

following is 

my estimation of the moves. 

Cave 

Cost 

Gain 

Index 

6 

0.114270098 

0.8400164 

7.3511479 

18 

0.114270098 

0.8400164 

7.3611479 

17 

0.114270098 

0.8400164 

7.3511479 

19 

0.100000001 

0.9 

8.99999988 

13 

0.100000001 

0.9 

8.99999988 

10" 

0.1 

2.9 

29.0000002 

5* 

0.1 

2.9 

29.0000002 

14 

0.0158556625 

0.93335156 

58.865508 

A 

after the cave number 

indicates that these 

figures 

reflect shooting into the 

cave before visiting it 

Cave 

Cost 

Gain 

Index 

18 

1.0 

0.0 

0.0 

13 

1.0 

0.0 

0.0 

14 

0.158556588 

0.333515584 

2.1034483 

10 * 

0.16665 

3.416685 

20.5021603 

5" 

0.16665 

3.416685 

20.5021603 

19 

0.0 

1.0 

10000.0 
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6 0.0 1.0 10000.0 

17 0.0 1.0 10000.0 

0 

We are now at cave 0. 

The neighboring caves are caves 11, IS, and 3. 

Uhat a stench! 

Uhat now? 3 

We are now at cave 3. 

The neighboring caves are caves 7, 15, and 0. 

What now? 15 

We are now at cave 15. 

The neighboring caves are caves 1, 4, 3, and 8. 

What now? 1_ 

We are now at cave 1. 

The neighboring caves are caves 15, 2, 14, and 9. 

Squeak. 

What a stench! 

What now? 14 

*** doves which the student seems to have correctly 
identified as worse are caves 13 and 18. #** 

Hummmm. 

*** Better moves which I have not mentioned to the student 
recently are caves 19, 17, 10, and 5. ### 

Bon Voyage! 

We are now at cave 8. 

The neighboring caves are caves 15, 19, 13, and 2. 

Brrrr. 

What now? 


it* The 

following is 

my estimation of the moves. 

Cave 

Cost 

Gain 

Index 

14 

0.158556588 

0.333515584 

2.1034483 

6 

0.114270098 

0.8400164 

7.3511479 

17 

0.114270098 

0.8400164 

7.3511479 

18 

0.114270098 

0.8400164 

7.3511479 

13 

0.100000001 

0.9 

8.99999988 

19 

0.100000001 

0.9 

8.99999988 

5 X 

0.1 

2.9 

29.0000002 

10 m 

0.1 

2.9 

29.0000002 

A 

after the cave number 

indicates that these 

figures 

reflect shooting into the 

cave before visiting 

Cave 

Cost 

Gain 

Index 

18 

1.0 

0.0 

0.0 
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13 

14 
5 ■ 
10 - 
6 
17 
19 


1.0 

0.168556688 
0.16665 
0.16665 
0.0 


0.0 


0.0 


0.0 

0.333515584 

3.416685 

3.416685 

1.0 

1.0 

1.0 


0.0 

2.1034483 

20.5021603 

20.5021603 

10000.0 

10000.0 

10000.0 


We are now at cave 2. 

The neighboring caves are caves 9, 1, 8, and 4. 
What a stench! 


What now? 9 

We are now at cave 9. 

The neighboring caves are caves 10, 2, 1, and 5. 
What a stench! 


What now? SHOOT 

Into which cave would you like to shoot? 10 
Poing 

We are now at cave 9. 

The neighboring caves are caves 10, 2, 1, and 5. 
What a stench! 


What now? 



fff The following is 

my estimation of the moves. 

Cave 

Cost 

Gain 

Index 

14 

0.158556588 

0.333515584 

2.1034483 

6 

0.114270098 

0.8400164 

7.3511479 

17 

0.114270098 

0.8400164 

7.3511479 

18 

0.114270098 

0.8400164 

7.3511479 

13 

0.100000001 

0.9 

8.99999988 

19 

0.100000001 

0.9 

8.99999988 

5 K 

0.1 

2.9 

29.0000002 

10 

0.0 

1.0 

10000.0 

A 

after the cave number 

indicates that these 

figures 

reflect shooting into the 

cave before visiting it 

Cave 

Cost 

Gain 

Index 

18 

1.0 

0.0 

0.0 

13 

1.0 

0.0 

0.0 

14 

0.158556588 

0.333516584 

2.1034483 

6 

0.0 

1.0 

10000.0 

17 

0.0 

1.0 

10000.0 

19 

0.0 

1.0 

10000.0 

10 

0.0 

1.0 

10000.0 

5 m 

10 

0.0 

6.1 

61000.0 
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Hummmm. 

*** Better moves which I have not mentioned to the student 
recently are cave 5. #*# 

Ue are now at cave 10. 

The neighboring caves are caves 17, 9, 19, and 16. 

Uhat a stench! 

What now? 17 

Hummmm. 

*** Better moves which I have not mentioned to the stydent 
recently are cave 5. #*# 

Ue are now at cave 17. 

The neighboring caves are caves 6, 10, 12, and 11. 

What a stench! 

What now? 6 

*** I am degrading the player’s receptivity because his move 
to cave 6 is a move that I told him was bad very recently. 
*** 

Ue are now at cave 6. 

The neighboring caves are caves 7, 12, 17, and 13. 

Brrrr. 

Uhat a stench! 

Uhat now? 12 
Hummmm. 

*** Better moves which I have not mentioned to the student 
recently are cave 5. #** 

Ue are now at cave 12. 

The neighboring caves are caves 5, 6, 14, and 17. 

Squeak. 

Uhat a stench! 

Uhat now? 17 

Ue are now at cave 17. 

The neighboring caves are caves 6, 10, 12, and 11. 

Uhat a stench! 

Uhat now? 10 

Ue are nou at cave 10. 

The neighboring caves are caves 17, 9, 19, and 16. 

What a stench! 

Uhat now? 19 

*** Moves which the student seems to have correctly 
identified as worse are caves 14 and 13. *** 

**# According to combination rule 0, I am marking rules 3 
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and 5 for bats because his move is better than cave 14. 

*** 

*** According to combination rule 0, I am marking rules 3 
and 5 for pits because his move is better than cave 13. 

*** 

Hummmm. 

*** Better moves which I have not mentioned to the 9 tudent 
recently are cave 5. #** 

We are now at cave IS. 

The neighboring caves are caves 14, 8, 18, and 18. 

Squeak. 

Brrrr. 

What a stench! 

What now? 


#** The 

following is 

my estimation of the moves. 

Cave 

Cost 

Gain 

Index 

14 

0.158556588 

0.060142154 

0.37931034 

18 

0.114270098 

0.81541279 

7.135837 

13 

0.100000001 

0.9 

8.99999988 

5 m 

0.1 

2.9 

29.0000002 

A 

""" after the cave number 

indicates that these 

figures 

reflect shooting into the 

cave before visiting it 

Cave 

Cost 

Gain 

Index 

13 

1.0 

0.0 

0.0 

18 

1.0 

0.0 

0.0 

14 

0.158556588 

0.060142154 

0.37931034 

5 m 

0.0 

6.1 

61000.0 


10 


He are now at cave 10. 

The neighboring caves are caves 17, 9, 19, and 16. 
What a stench! 

What now? 16 

We are now at cave 16. 

The neighboring caves are caves 13, 5, 10, and 0. 
Brrrr. 

What a stench! 

What now? SHOOT 

Into which cave uould you like to shoot? 5 

Congratulations, Syndi, you have shot the Uumpue. 

Your record is two wins and one losses. 

Would you like to play again? YES 

In this game, Syndi, there will be 20 caves all 
total, two pits, and two caves with bats. 
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Just a second while I 
#*# Time is 208. *** 
O.K., nou I will draw 
*** I am altering the 
for a player of phase 


erase my blackboard, 
up a new warren. 

game to create a situation appropriate 

0 . *** 


We are now at cave 5. 

The neighboring caves are caves 8, 12, 3, and 19. 

Uhat now? 8 

We are now at cave 8. 

The neighboring caves are caves 5, 4, S, and 7. 
Squeak. 

Srrrr. 

Uhat a stench! 

Uhat now? 


W The 
Cave 
7 
6 
4 
19 
12 

3 

A 

figures 

Cave 

7 

6 

4 
19 
12 
3 

5 


following is mg estimation of the moves. 
Cost Gain Index 


0.114270098 

0.114270098 

0.114270098 

0.0 


0.88430289 

0.88430289 

0.88430289 

1.0 


0.0 1.0 

0.0 1.0 

""" after the cave number 
reflect shooting into the 
Cost Gain 


0.36856813 

0.36856813 

0.36856813 


0.67235283 

0.67235283 

0.67235283 


0.0 1.0 


0.0 1.0 

0.0 1.0 


7.7387077 

7.7387077 

7.7387077 

10000.0 

10000.0 

10000.0 

indicates that these 
cave before visiting it. 
Index 

1.82422945 
1.82422945 
1.82422945 
10000.0 
10000.0 
10000.0 


Ue are now at cave 5. 

The neighboring caves are caves 8, 3, 12, and 19. 


Uhat now? 3 


#** Moves which the student seems to have correctly 
identified as worse are caves 4, S, and 7. *** 

Ue are now at cave 3. 

The neighboring caves are caves 18, 9, and 5. 

What now? 18. 

Ue are now at cave 18. 

The neighboring caves are caves 3, 17, and 14. 
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What now? 17 

Ue are now at cave 17. 

The neighboring caves are caves 15, 18, 11, and 14, 
Brrrr, 

What now? 


### The 

following is 

my estimation of the moves. 

Cave 

Cost 

Gain 

Index 

7 

0.114270098 

0.86954073 

7.6095211 

6 

0.114270098 

0.86954073 

7.6095211 

4 

0.114270098 

0.86954073 

7.6095211 

11 

0.100000001 

0.9 

8.99999988 

15 

0.100000001 

0.9 

8.99999988 

19 

0.0 

1.0 

10000.0 

12 

0.0 

1.0 

10000.0 

9 

0.0 

1.0 

10000.0 

14 

0.0 

1.0 

10000.0 

A 

""" after the cave number 

indicates that these 

figures 

reflect shooting into the 

cave before visiting 

Cave 

Cost 

Gain 

Index 

11 

0.5 

0.5 

1.0 

15 

0.5 

0.5 

1.0 

7 

0.36856813 

0.63590305 

1.7253338 

6 

0.36856813 

0.63590305 

1.7253338 

4 

0.36856813 

0.63590305 

1.7253338 

19 

0.0 

1.0 

10000.0 

12 

0.0 

1.0 

10000.0 

9 

0.0 

1.0 

10000.0 

14 

0.0 

1.0 

10000.0 


14 


Ue are now at cave 14. 

The neighboring caves are caves 0, 11, 18, and 17. 
Squeak. 

Brrrr. 


What now? 1£ 

We are now at cave 18. 

The neighboring caves are caves 3, 17, and 14. 


What now? 3 


We are now at cave 3. 

The neighboring caves are caves 18, 9, and 5. 

What now? 9 

*** Moves which the student seems to have correctly 
identified as worse are caves 0 and 11. *** 

>Mc# According to combination rule 0, I am marking rules 3 
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and 5 for bats because his move is better than cave 0. *** 
*** According to combination rule 0, I am marking rules 3 
and 5 for pits because his move is better than cave 11. 

*** 

Ue are now at cave 9. 

The neighboring caves are caves 3, 6, 10, and 2. 

Uhat a stench! 

Uhat now? 6 

Ue are now at cave G. 

The neighboring caves are caves 8, 9, 11, and 1. 

Brrrr. 

What a stench! 

Uhat now? 9 

We are now at cave 9. 

The neighboring caves are caves 3, G, 10, and 2. 

Uhat a stench! 

Uhat now? 10 

*** Moves which the student 9eems to have correctly 
identified as worse are caves 1, 4, 7, and 11. **# 

*** According to combination rule 0, I am marking rules 3, 
5, and 19 for the Uumpus because his move i9 better than 
cave 1. *** 

*** I am advancing the student to phase 2 for the Uumpus as 
he has mastered rules 3, 5, and 19. **# 

##* According to combination rule 8, I am marking rules 3 
and 5 for pits because his move is better than cave 11. 

*** 

Ue are nou at cave 10. 

The neighboring caves are caves 9, 2, 15, and 13. 

Uhat a stench! 

Uhat now? 2 

Ue are now at cave 2. 

The neighboring caves are caves 10, 0, IS, and 9. 

Squeak. 

Uhat a stench! 

Uhat now? 10 

Ue are now at cave 10. 

The neighboring caves are caves 9, 2, 15, and 13. 

Uhat a stench! 

Uhat now? 15 

Moves which the student seems to have correctly 
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identified as worse are caves 16, 13, and 3. *** 

*** According to combination rule 0, I am marking rules 3 
and 5 for the Uumpus because his movB is better than cave 16. 
*** 

*** According to combination rule 0, I am marking rules 3 
and 5 for bats because his move is better than cave 0. *** 

Ue are now at cave 15. 

The neighboring caves are caves 17, 13, and 10. 

What now? 





### The following is my estimation of the moves. 


Cave 

Cost 

Gain 

Index 

7 

0.114270098 

0.8400164 

7.3511479 

4 

0.114270098 

0.8400164 

7.3511479 

11 

0.100000001 

0.9 

8.99999988 

1 ' 

0.1 

2.81 

28.1 

16 m 

0.1 

2.8400164 

28.400164 

0 

0.0168566625 

0.93335155 

58.865503 

12 

0.0 

1.0 

10000.0 

13 

0.0 

1.0 

10000.0 

19 

0.0 

1.0 

10000.0 

A 

after the cave number 

indicates that these 

figures 

reflect shooting into the 

cave before visiting 

Cave 

Cost 

Gain 

Index 

11 

1.0 

0.0 

0.0 

7 

0.53963915 

0.358378894 

0.6641084 

4 

0.53963915 

0.358378894 

0.6641084 

0 

0 .158556588 

0.333515584 

2.1034483 

r 

0.161094997 

3.5061289 

21.764356 

16“ 

0.127765 

4.0427918 

31.642405 

12 

0.0 

1.0 

10000.0 

19 

0.0 

1.0 

10000.0 

13 

0.0 

1.1 

11000.0 


13 


Hummmm. 

*** 8etter moves which I have not mentioned to the student 
recently are cave 13. *** > 

Cave 13 is a very good cave to explore, Syndi, as we 
know that one of caves 2 and 13 must be next to the Uumpus. 
Cave 10 must be exactly two caves away from the Uumpus 
because we smelled the Uumpus in cave 10. This means that 
cave 10 is within two caves of the Uumpus. Ue also know 
that cave 10 is not next to the Uumpus. Hence cave 10 is 
exactly two caves away from the Uumpus. Therefore one of 
caves 9, 2, 15, and 13 must be next to the Uumpus, but 
neither of caves 9 and 15 are next to the Uumpus. This 
means that one of caves 2 and 13 must be next to the Uumpus, 
and so if we visit cave 13 we will gain information about 
the location of the Uumpus. 

*** I have used: (rules 8, 4, and 7 for the Uumpus) in 
explaining combination rule 5. *** 
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UUU I have just "learned" rule 8 for the Wumpus. UUU 
UUU I have just "learned" rule 4 for the Uumpus. UUU 
UUU I have just "learned" rule 7 for the Uumpus. UUU 
Would you like to go to cave 19 anyway? 

This experiment continued on for many more games. 
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